(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT cboPERATION TREATY (PCT) 



AZ. 



(19) World Intellectual Property Organization 

International Bureau 

i 

(43) International Publication Date 
29 March 2001 (29.03.2001) 




PCT 




s ■ 



(10) International Publication Number 

WO 01/21772 A2 



(51) International Patent Classification 7 : 



C12N9/00 



(21) International Application Number: PCI7US00/25993 



(22) International Filing Date: 

21 September 2000 (21.09.2000) 



(25) Filing Language: 

(26) Publication Language: 



English 

i 

English 



(30) Priority Data: 

09/400,494 
60/210,072 
60/221,836 
60/227,860 



2 1 September 1 999 (2 1 .09. 1 999) US 

7 June 2000 (07.06.2000) US 

28 July 2000 (28.07.20b0) US 

24 August 2000 (24.08.2000) US 



(71) Applicant: OMNIGENE BIOPRODUCTS [US/US]; 
763(d) Concord Avenue, Cambridge, MA 02138 (US). 

(72) Inventors: YOCUM, Rogers, R.; 4 Orchard Lane, Lex- 
ington, MA 02420 (US). PATTERSON, Thomas, A.; 89 



Church Street, North Attleboro, MA 02760 (US)1 HER- 
MANN, Theron; 1 8 Chjlhowie Drive, Kinnelon, NJ 07405 
1 (US). PERO, Janice, G.t 20 Solomon Pierce Road, Lex- 
' ington, MA 02420 (US). 

^ i 

t f 

(74) Agents: HANLEY, Elizabeth, A. et al.; Lahive & Cock- 
field, LLP, 28 State Street, Boston, MA 02109 (US). 

i 

i ' 
(81) Designated States (national): AU, BR, CA, CN t CZ, HU, 
ID, IL, IN, JP, KR, LT, LV, MX, NO, PL, RU, SG, SK, TR, 
UA, ZA. 

(84) Designated States (regional): European patent (AT, BE, 
CH, CY, DE. DK, ES ; H. FR t GB. GR. IE. FT LIL MC 
NL, FT, SE). 

Published: 

— Without international search report and to be republished 
upon receipt of that report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



^ (54) Title: METHODS AND MICROORGANISMS FOR PRODUCTION OF PANTO-COMPOUNDS 

C*n| (57) Abstract: The present invention features methods of producing panto-compounds (e.g., pantothenate) using microorganisms 
in which the pantothenate biosynthetic pathway and/or the isoleucine-valine biosynthetic pathway and/or the coenzymeA biosyn- 
thetic pathway has been manipulated. Methods featuring ketopantoate reductase overexpressing microorganisms as well as aspartate 
a-decarboxylase overexpressing microorganisms are provided. Methods of producing panto-compounds in a precursor-independent 
manner and in high yield are described- Recombinant microorganisms, vectors, isolated nucleic acid molecules, genes and gene 
products useful in practicing the above methodologies are also provided. The present invention also features a previously unidenti- 
fied microbial pantothenate kinase gene, coaX, as well as methods of producing panto-compounds utilizing microorganisms having 
modified pantothenate kinase activity. Recombinant microorganisms, vectors, isolatedcoaA' nucleic acid molecules and purified 
CoaX proteins are featured. Also featured are methods for identifying pantothenate kinase modulators utilizing the recombinant 
microorganisms and/or purified CoaX proteins of the present invention. 



^SDOCID: < WO 0 1 21 772A2 J_> 



* 

. WO 01/21772 PCT/US00/25993 

« 

1 . - i - 

• ■ . > ■ 

METHODS AND MICROORGANISMS FOR 
PRODUCTION OF PANTO-COMPOUNDS 
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Background of the Invention 

5 Pantothenate, also known as pantothenic acid or vitamin B5, is a member of the 

B complex of vitamins and is a nutritional requirement for mammals, including 
livestock and humans (e.g., from food sources, as a water soluble vitamin supplement or 
as a feed additive). In cells, pantothenate is used primarily for the biosynthesis of 
coenzyme A (CoA) and acyl carrier protein (ACP). These coenzymes function in the 

10 metabolism of acyl moieties which form thioesters with the sulfhydryl group of the 4'- 
phosphopantetheine portion of these molecules. These coenzymes are essential in all 
cells, participating in over 100 different intermediary reactions in cellular metabolism. 

The conventional means of synthesizing pantothenate (in particular, the bioactive 
D isomer) is via chemical synthesis from bulk chemicals, a process which is hampered 

15 by excessive substrate cost as well as the requirement for optical resolution of racemic 
intermediates (e.g., resolution of DL-pantolactone to obtain D-pantolactone for chemical 
condensation with P-alanine). Accordingly, researchers have recently looked to 
bacterial or microbial systems that produce enzymes useful in pantothenate biosynthesis 
processes (as bacteria are themselves capable of synthesizing pantothenate). In 

20 particular, bioconversion processes have been evaluated as a means of favoring 
production of the D isomer of pantothenic acid, e.g., using microorganisms which 
selectively hydrolyze a DL-pantothenic acid ester to D-pantothenic acid; 
microorganisms which selectively decompose L-pantolactone resulting in D- 
pantolactone alone; and microorganisms which selectively hydrolyze DL-pantolactone 

25 to D-pantoic acid. 

There is still, however, significant need for improved pantothenate production 
processes, in particular, for processes requiring reduced quantities of substrates and/or 
less expensive substrates. To this end, methods of direct microbial synthesis have 
recently been examined as a means of improving D-pantothenate production. In 

30 microbes, pantothenate biosynthetis is a multistep pathway resulting in condensation of 
pantoate (derived from a-ketoiso valerate) and p-alanine to form D-pantothenate. The 
isoleucine-valine (/7v) pathway biosynthetic enzymes, acetohydroxyacid synthetase (the 
HvBNot alsS gene product), acetohydroxyacid isomeroreductase (the ilvC gene product) 
and dihydroxyacid dehydratase (the ilvD gene product) catalyze the conversion of 

35 pyruvate to a-ketoisovalerate. The reactions are further catalyzed by the pantothenate 
(pan) pathway biosynthetic enzymes ketopantoate hydroxymethyltransferase (the panB 
gene product), ketopantoate reductase (the panE gene product), aspartate-a- 
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.decarboxylase (the panD gene product) and pantqthenate synthetase (the panC gene 

product). ( • 1 

The genes encoding the enzymes involved in the biosynthesis of pantothenic acid 
in Salmonella typhimurium and Escherichia coli have recently been identified and 
5 characterized (Frodyma and Downs (1998) J. Biol Chem. 273:5572-5576 and 

Jackowski (1996) pp. 687-694, /nNeidhardt et al (ed.) Escherichia coli and Salmonella: 
Cellular and Molecular Biology, ^ ed. Am. Soc. Microbiol. Wash, D.C). In E. coli 9 for 
example, the biosynthesis of pantothenic acid consists of four key steps. The first 
reaction is catalyzed by the panB gene product, ketopantoate hydroxymethyltransferase, 

10 and uses the L-valine intermediate a-ketoiso valerate to generate ketopantoate, which is 
subsequently reduced to pantoate by the panE gene product, ketopantoate reductase. 
The panD gene product, aspartate-a-decarboxylase, generates P-alanine from aspartate. 
The panC gene product, pantothenate synthetase, subsequently ligates p-alanine with 
pantoate to yield D-pantothenate. J , 

15 The authors Dusch et aL described the identification of the Corynebacierium 

glutamicum panD gene and reported that expression of the C glutamicum panD gene in 
E. coli yielded a strain producing pantothenate with a specific productivity of 140 ng of 
pantothenate per mg (dry weight) per hour. (Dusch et al (1999) Appl Environ. 
Microbiol 65:1530-1539). 

20 The authors Sahm and Eggeling have further identified the Corynebacierium 

glutamicum panB and pan C genes and have described a genetically engineered strain of 
C glutamicum which overexpresses the panBC genes (Sahm and Eggeling (1999) Appl 
Environ. Microbiol 65:1973-1979). The engineered strain produces pantothenate, 
however, it was necessary to overexpress the genes responsible for a-ketoisovalerate 

25 production in the host organism in order that pantothenic acid production could be 
detected. Moreover, without the addition of p-alanine, no substantial amounts of 
pantothenate accumulated with the strain constructed. 

Likewise, a method of producing D-pantothenic acid has been described that 
takes advantage of a sodium salicylate resistant mutant strain of E. coli which produces 

30 D-pantothenic acid when cultured in the presence of P-alanine (U.S. Patent No. 

5,5 1 8,906). Generation of E* coli strains resistant to a-ketoisovaleric acid and/or ct- 
ketobutyric acid, and/or a-aminobutyric acid, and/or p-hydroxyaspartic acid and/or O- 
methyl-threonine, in addition to salicylic acid, further increased pantothenic acid 
production. Moreover, transformation of a plasmid DNA carrying the panB,panC and 

35 panD genes into the salicylic acid resistant mutant strain resulted in increased 
pantothenate production, however, up to 20 g/L p-alanine or more was fed in the 
examples given. The panB-panC-panD genes are clustered on the E. coli chromosome. 
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Finally, a method of producing D-pantothenic acid has been described which 
utilizes a salicylic acid-resistant, a-ketoisovalerate-resistant, a-ketobiityrate-resistant, p- 
hydroxyaspartate-resistant, o-methylthreonine-resistent E. coli strain transformed with 
pantothenate biosynthesis gene-containing DNA fragments and/or branched amino acid 
5 biosynthesis gene-containing DNA fragments and cultured in the presence of p-alanine 
(U.S. Patent No. 5,932,457). ' , t . , 

Pantothenate production in bacteria results from the condensation of pantoate 
and P-alanine and involves the pantothenate biosynthetic enzymes ketopantoate 
hydroxymethyltransferase (the panB gene product), ketopantoate reductase (the panE 

10 gene product), aspartate-a-decarboxylase (the panD gene product) and pantothenate 
synthetase (the panC gene product). Although pantothenate is biologically active as a 
vitamin, it is further metabolized in all cells to Coenzyme A (CoA) which participates as 
an acyl group carrier in the tricarboxylic acid (TCA) cycle, ifatty acid metabolism and 
numerous other reactions of intermediary metabolism. The initial (and possibly rate- 

15 controlling) step in the conversion of pantothenate to Coenzyme A (CoA) is 

phosphorylation of pantothenate by pantothenate kinase. A pantothenate kinase activity 
was first identified in Salmonella typhimurium by screening for temperature-sensitive 
mutants which synthesized CoA at permissive temperatures but excreted pantothenate at 
non-permissive temperatures. The mutations were mapped in the Salmonella 

20 chromosome and the genetic locus was designated coaA. The gene encodes the enzyme 
that catalyzes the first step in the biosynthesis of coenzyme A from pantothenate (Dunn 
and Snell (1979) J. BacterioL 140:805-808). Escherichia coli temperature sensitive 
mutants have also been isolated and characterized (Vallari and Rock (1987) J. BacterioL 
169:5795-5800). These mutants (named coaA15(Ts)) are defective in the conversion of 

25 pantothenate to CoA and further exhibit a temperature-sensitive growth phenotype, 
indicating that pantothenate kinase activity is essential for growth. Moreover, it was 
noted that CoA inhibited pantothenate kinase activity to the same degree in the mutant 
as compared to the wild-type enzyme. 

Feedback resistant E. coli mutants (named coaAJ6(Fr)) have also been isolated 

30 that posses a pantothenate kinase activity that is refractory to feedback inhibition by 
CoA (Vallari and Jackowski (1988) J. BacterioL 170:3961-3966). The mutation 
responsible for the reversion is, suprisingly, not genetically linked to the coaA gene by 
transduction. Additional data described therein support the view that the total cellular 
CoA content is controlled by both modulation of biosynthesis at the pantothenate kinase 

35 step and possibly by degradation of CoA to 4*-phosphopantetheine. 
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The wild-type E. coli coaA gene, was cloned by .functional complementation of , 
E. coli temperature-sensitive mutants. The sequence of the wild-type gene was 
determined (Song and Jackowski (1992) J. Bacterid; 174:641 1-6417 and Flamm et al 
(1988) Gene (Amst.) 74:555-558). Strains containing multiple copies of the coaA gene 
5 possessed 76-fold higher specific activity of pantothenate kinase, however, there was 
only a 2.7-fold increase in the steady state level of CoA (Song and Jackowski, supra). It 
has further been reported that the prokaryotic enzyme (encoded by coaA in E.coli and a 
variety pf other microorganisms) is feedback inhibited by . CoA both in vivo and in vitro 
with CoA being about five times more potent than.acetyl-CoA in inhibiting the enzyme 
10 (Song and Jackowski, supra and Vallari et al, supra). Moreover, it has been reported 
that the panB gene product in E. coli is inhibited by CoA (Powers and Snell (1976) J. 
Biol Chem. 251 :3786-3793). These data further support the view that feedback 
inhibition of pantothenate kinase activity is a critical factor controlling intracellular CoA 
concentration. 

15 Using standard search and alignment tools, coaA homologues have been 

identified in Hemophilus influenzae, Mycobacterium tuberculosis y Vibrio cholerae. 
Streptococcus pyogenes and Bacillus subtilis. By contrast, proteins with significant 
similarity could not be identified in eukaryotic cells including Saccharomyces cerevisiae 
or in mammalian expressed sequence tag (EST) databases. Using a genetic selection 

20 strategy, a cDNA encoding pantothenate kinase activity has recently been identified 
from Aspergillus nidulans (Calder et al (1999) J. Biol Chem. 274:2014-2020). The, 
eukaryotic pantothenate kinase gene (panK) has distinct primary structure and unique 
regulatory properties that clearly distinguish it from its prokaryotic counterpart. A 
mammalian pantothenate kinase gene (mpanKla) has also been isolated which encodes 

25 a protein having homology to the A. nidulans PanK protein and to the predicted gene 
product of GenBank™ Accession Number 927798 identified in the S. cerevisiae 
genome pocket al. (2000) J. Biol Chem. 275:1377-1383). 

Summary of the Invention 

30 The present invention is based, at least in part, on the discovery of key enzyme- 

encoding genes of the pantothenate biosynthetic pathway in Bacillus subtilis. In 
particular, the present inventors have identified the panE gene of B. subtilis. 
Overexpression or deregulation of the panE gene in B. subtilis results in enhanced 
production of the panE gene product, ketopantoate reductase, further resulting in 

35 increased production of pantothenate. Likewise, mutations in this gene reduce 
pantothenate production in B. subtilis >90%. The present inventors have further 
identified the presumptive panBCD operon in B. subtilis 9 overexpression or 
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deregulation of which results in increased pantothenate production. The present 
inventors have further demonstrated that overexpression or deregulation of the panD 
gene in B. subtilis (resulting in enhanced production of thfe panD gene product, 
aspartate-a-decarboxylase) further results in increased production of pantothenate, in 
5 particular, in combination with deregulation of genes encoding key enzymes of the 
isoleucine-valine (i/v) biosynthetifc pathway. 1 

Accordingly, the present invention features methods of producing pantothenate, 

! 1 ' r 1 , 1 i 

as well as other compounds of the pantothenate biosynthetic pathway (e.g., 
1 ketopantoate, pantoate and P-alanine), termed "panto-compounds" herein, using 

10 microorganisms in which the pantothenate biosynthetic pathway and/or isbteucine- 

t 

valine biosynthetic pathway has been manipulated such that pantothenate or other 
desired panto-compounds are produced. In one embodiment, the invention features a 
method of producing a panto-compound (e.g., pantothenate or pantoate) that involves 
culturing a microorganism which overexpresses the panE gene product, ketopantoate 

1 5 reductase, also referred to herein as a ketogantoate reductase-overexpressing or 
"KPAR-O" microorganism, under conditions such that the panto-compound (e.g., 
pantothenate or pantoate) is produced. In another embodiment, the present invention 
features a method of producing panto-compounds (e.g., pantothenate or pantoate) 
which includes culturing a microorganism which overexpresses at least one 

20 pantothenate biosynthetic enzyme (e.g. , at least one of the panB, panC or panD gene 
products), preferably in a KPAR-O microorganism, under conditions such that the 
panto-compound (e.g., pantothenate or pantoate) is produced. 

Yet another aspect of the invention features methods of producing panto- 
compounds which are independent of the need to feed precursors (e.g., p-alanine or 

25 aspartate and/or a-ketoisovalerate or valine). In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
that includes culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated isoleucine-valine (ilv) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 

30 method of producing pantothenate in a manner independent of precursor feed that 

includes culturing an AaD-O microorganism having a deregulated pantothenate (pan) 
pathway and a deregulated isoleucine-valine (i/v) pathway, under conditions such that 
pantothenate is produced. In another embodiment, the invention features a method of 
producing pantothenate in a manner independent of aspartate or P-alanine feed that 

35 includes culturing an AaD-O microorganism under conditions such that pantothenate is 
produced. In another embodiment, the invention features a method of producing 
pantothenate in a manner independent of valine or a-ketoisovalerate feed that includes 
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culturing a microorganism having a deregulated isoleucine-valine (/7v) biosynthetic 
pathway under conditions such that pantothenate is produced. In yet another 
embodiment, the invention features a high yield production method for producing 
pantothenate that includes culturing a manipulated microorganism under conditions 
5 such that pantothenate is produced at a significantly high. yield (e.g., at a level greater 
than 1 0 g/L, 20 g/L, 30 g/L or 40g/L). '. ' '> 

The methods of the present invention further feature microorganisms that 
overexpresses acetohydroxyacid synthetase or acetohydroxyacid isoriieroreductase (e.g., 
microorganisms transformed with a vector that includes an ilvBNC nucleic acid 

10 sequence), microorganisms that overexpresses dihydroxy acid dehydratase (e.g., 

'it 

microorganisms transforrhed with a vector that includes an ilvD nucleic acid sequence), 
microorganisms that overexpresses aspartate-a-decarboxylase (e.g., microorganisms 
transformed with a vector that includes a panD nucleic acid sequence), microorganisms 
having a deregulated isoleucine-valine (//v) biosynthetic pathway and microorganisms 

1 5 having a deregulated pantothenate biosynthetic pathway (e.g. , microorganisms that 
overexpress any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, 
pantothenate synthetase and aspartate-a-decarboxylase, for example, microorganisms 
transformed with a vector comprising a panBCD nucleic acid sequence or a vector 
comprising a panEl nucleic acid sequence). In one embodiment, the recombinant 

20 microorganism is Gram positive (e.g., microorganisms belonging to the genus Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci or Streptomyces). In another embodiment, 
the recombinant microorganism is Gram negative. Particularly preferred is a Bacillus 
recombinant microorganism (e.g., Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus subtilis, Bacillus pumilus, Bacillus halodurans, and the like). Recombinant 

25 vectors that contain the genes encoding Bacillus pantothenate and/or isoleucine-valine 
biosynthetic enzymes (e.g., B. subtilis pantothenate and/or isoleucine-valine biosynthetic 
enzymes) are also described. 

Also featured are methods of producing p-alanine that include culturing an 
aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism under conditions 

30 such that p-alanine is produced and methods of producing P-alanine that involve 

contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced. 

The production methods of the present invention further can include recovering 
the panto-compound (e.g., pantothenate or pantoate). 

35 The present invention further features recombinant microorganisms (e.g., AaD- 

O microorganisms, microorganisms having a deregulated isoleucine-valine (ilv) 
pathway, microorganisms overexpressing at least one of ketopantoate 
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hydroxy methyl transferase (the panB gene product), pantothenate synthetase (the panC 
gene product), aspartate-ct-decarboxylase (the panD gene product), ketopantoate 1 
reductase (the panEl gene product) and microorganisms having a deregulated panBCD 
operon. Also featured are panB, panC, panD, panE 9 ilyB, ilvN, alsS 9 ilvC 9 and/or ilvD 
5 nucleic acid molecules, as well as vectors including such nucleic acid molecules and 
gene products encoded by such nucleic acid molecules. 

The methodology of the present invention further includes, for example in , 

' I * ' 

addition to overexpressing at least one pantothenate biosynthetic enzyme, deleting or 
t I ■ 

mutating a second pantothenate biosynthetic enzyme, said second pantothenate 

10 biosynthetic enzyme preferably being downstream of the desired product in the 

pantothenate biosynthetic pathway. For example, mutating panC, in addition to 

overexpressing the panE gene product, results in even further enhanced or increased 

production of pantoate. Accordingly, in one embodiment, the invention features a 

method of producing pantoate which includes culturing a microorganism which 

1 5 overexpresses the panE gene product and which has a deletion in the panC gene. In 

another embodiment, the invention features a method of producing pantoate which 

includes culturing a microorganism which overexpresses the panE gene product and/or 

panB gene product and which has a deletion in the panC gene. Other exemplary 

embodiments include a method of producing ketopantoate which includes culturing a 

20 microorganism which overexpresses the panB gene product and which has a deletion in 

the panE gene and a method of producing P-alanine which includes culturing a 

microorganism which overexpresses the panD gene product and which has a deletion in 

the panC gene. Also included are methods of producing panto-compounds which 

include overexpressing at least one valine biosynthetic enzyme in a microorganism 

25 which has at least one pantothenate biosynthetic enzyme deleted. 

The present invention is also based at least in part, on the identification and 

characterization of a previously unidentified microbial pantothenate kinase gene, coaX. 

CoaXwas first identified in Bacillus subtilis and corresponds to an open reading frame 

in a portion of the chromosomal DNA that includes the 5* end of the ftsH gene, and all 

30 of the yacB, yacQ yacD, cysK and pabB genes. The present inventors have 

demonstrated that the yacB open reading frame encodes a novel pantothenate kinase 

activity, the gene being unrelated by homology to any previously known pantothenate 

kinase gene. The gene has been renamed coaX, as it encodes the enzyme which 

catalyzes the first step in the pathway from pantothenate to CoaA. 

35 Accordingly, the present invention features new and improved methods of 

producing pantothenate and other key compounds of the pantothenate biosynthetic 

pathway (e.g., panto-compounds) utilizing microorganisms having modified 
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pantothenate kinase activity. In particular, the present invention features recombinant , 
microorganisms that contain the coaXgenc or that contain a mutant coaX gene, having 
reduced pantothenate kinase activity. In one embodiment, the invention features such 
recombinant microorganisms further having a deregulated pantothenate biosynthetic 
5 pathway. In another embodiment, the invention features such recombinant 

microorganisms further having a deregulated isoleucine-valine (/7v) pathway. In a 
1 preferred embodiment, the microorganisms belong to the genus Bacillus (e.g.,B. ■ 
subtilis). | 

The present invention also features recombinant microorganisms (e.g., 
1 0 microorganisms belonging to the genus Bacillus, for exampile, B. subtilis) that contain 
the coaA gene or that contain a mutant coaA gene, optionally including a coaX gene or 
mutant thereof, having reduced pantothenate kinase activity. In one embodiment, the 
invention features such recombinant microorganisms further having a deregulated 
pantothenate biosynthetic pathway or having a deregulated isoleucine-valine (//v) 
15 pathway. 1 

Also featured are vectors that contain isolated coaX or coaA genes as well as , 
mutant coaX and/or coaA genes. Isolated nucleic acid molecules that contain isolated 
coaX genes or mutant coaX genes are featured in addition to isolated CoaX proteins and 
mutant CoaX proteins. 

20 The nucleic acids, vectors and recombinant microorganisms described above are 

particularly useful in the methodologies of the present invention. In particular, the 
invention features methods of enhancing panto-compound production (e.g., 
ketopantoate, pantoate and or pantothenate production) that include culturing a 
recombinant microorganism having a mutant coaX gene under conditions such that 

25 panto-compound production is enhanced. In one embodiment, the recombinant 
microorganism further includes a mutant coaA gene. In another embodiment, the 
recombinant microorganism further includes a mutant avtA and/or mutant ilvE gene 
and/or mutant ansB gene and/or mutant alsD gene. Also featured are methods for 
identifying pantothenate modulators utilizing the recombinant microorganisms and 

30 purified CoaX proteins of the present invention. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 
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Brief Description of the Drawings 
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Figure 7 is a schematic representation of the pantothenate biosynthetic pathway. 

Figure 2 is a schematic representation of the plasmid pAN240, containing 
sequences ligated upstream of the P2tpanBCD cassette, equivalent to the integrated 
5 version in strain PA221. ( 

Figure 3 A is a schematic representation of the plasmid pAN004, containing the 
panBCD operon expressed from P26, and RBS1. 

Figure 3B is a schematic representation of the plasmid pAN006, containing the 
panBCD operon expressed from P26 and RBS2. 
10 Figure 4 is a schematic representation of the plasmid pAN236; containing an 

integratable and amplifiable P2d-RBS2-pa«£7 expression cassette. 

Figure 5 is a schematic representation of the construction of plasmid pAN423. 

Figure 6 is a schematic representation of the construction of plasmids pAN426 
andpAN427. 

15 Figure 7 is a schematic representation of the construction of plasmids pAN428 

and pAN429. 

Figure 8 is a schematic representation of the construction of plasmid pAN431 . 

Figure 9 is a schematic representation of the construction of plasmid pAN441. 

Figure 10 is a schematic representation of the construction of plasmid pAN440. 
20 Figure 77 is a schematic representation of the plasmid pAN251 designed to 

integrate a single copy of a P26~panEl cassette at the panEl locus by double crossover. 

Figure 72 is a schematic representation of the plasmid pAN267 designed to 
integrate a single copy of a P26~HvBNC cassette at the amyE locus. 

Figure 13 is a schematic representation of the plasmid pAN257 9 a clone of 
25 Bacillus subtilis ilvD in a low copy vector. 

Figure 14 is a schematic representation of the plasmid pAN263, designed to 
integrate a single copy of a P26-HvD cassette at the ilvD locus. 

Figure 15 is a schematic representation of the plasmid pAN261 9 designed to 
disrupt the Bacillus subtilis ilvD gene with the cat gene. 
30 Figure 16 is a schematic representation of the Coenzyme A biosynthetic pathway 

in E. coli. 

Figure 77 is a schematic representation of the structure of pAN296, a plasmid 
designed to delete most of the B. subtilis coaA gene and substitute a chloramphenicol 
resistance gene. 

35 Figure 18 is a schematic representation of the structure of the Bacillus subtilis 

genome in the region of the coaA gene. The scale is in base pairs and the significant 
open reading frames are shown by open arrows. 
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Figure J 9 is a schematic representation of the plasmid pAN281, a plasmid for . 
expressing Bacillus subtilis coaA after integratibn at the bpr locus. 

Figure 20A-B depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by six known or predicted microbial coaA genes. SEQ ID NOs:4-6 
5 and 1-3 correspond to the amino acid sequences of Mycobacterium leprae (SwissProt™ 
Accession No. Q9X795), Mycobacterium tuberculosis (SwissPrdtf™ Accession No. 
053440), Streptomyces coelicolor (SwissProt™ Accession No. 086799), Haemophilus 
influenzae (SwissProt™ Accession No. P44793), Escherichia coli SwissProt™ 
Accession No. PI 5044) and Bacillus subtilis (SwissProt™ Accession No. P54556), 

1 0 respectively. The alignment was generated using ClustalW MSA software at the 
GenomeNet CLUSTALW Server at the Institute for Chemical Research, Kyoto 
University. The following parameters were used: Pairwise Alignment, K T tuple (word) 
size — 1, Window size = 5, Gap Penalty = 3, Number of Top Diagonals = 5, Scoring 
Method = Percent; Multiple Alignment, Gap Open Penalty =10, Gap Extension Penalty 

15 = 0.0, Weight Transition = No, Hydrophilic residues = Gly, Pro, Ser, Asn, Asp, Gin, 
Glu, Arg and Lys, Hydrophobic Gaps = Yes; and Scoring Matrix = BLOSUM. 

Figure 21 is a schematic representation of the structure of the Bacillus subtilis 
genome in the region of the coaX (yacB) gene. The scale is in base pairs, the significant 
open reading frames are shown by open arrows and certain predicted restriction 

20 fragments are indicated by thick bars. 

Figure 22 is a schematic representation of the structure of pAN341 and pAN342, 
two independent PCR-derived clones of B. subtilis yacB (remaned herein as coaX). 

Figure 23A-D depicts a multiple sequence alignment (MSA) of the amino acid 
sequences encoded by fourteen known or predicted microbial coaX genes. SEQ ID 

25 NOs:9, 74, 7-8, 75, 11, 10 and 12-18 correspond to the amino acid sequences of Bacillus 
subtilis (SwissProt™ Accession No. P37564), Clostridium acetobulyticum (WIT™ 
Accession No. RCA03301, Argonne National Laboratories), Streptomyces coelicolor 
(PIR™ Accession No. T36391), Mycobacterium tuberculosis (SwissProt™ Accession 
No. 006282), Rhodobacter capsulatus (WIT™ Accession No. RRC02473), 

30 Desulfovibrio vulgaris (DBJ™ Accession No. BAA2 1476.1), Deinococcus radiodurans 
(SwissProt™ Accession No. Q9RX54), Thermotoga maritima (GenBank™ Accession 
No. AAD35964.1), Treponema pallidum (SwissProt™ Accession No. 083446), 
Borrelia burgdorferi (SwissProt™ Accession No.051477), Aquifex aeolicus 
(SwissProt™ Accession No. 067753), Synechocystis sp. (SwissProt™ Accession No. 

35 P74045), Helicobacter pylori (SwissProt™ Accession No. 025533), and Bordetella 
pertussis (SwissProt™ Accession No. Q45338), respectively. The alignment was 
generated using ClustalW MSA software at the GenomeNet CLUSTALW Server at the 
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Institute for Chemical Research, Kyoto University. The following parameters were 

used: Pairwise Alignment, K-tupIe (word) size = 1, Window size = 5, Gap Penalty = 3, 

Number of Top Diagonals -5, Scoring Method = Percent; Multiple Alignment, Gap 

Open Penalty =10, Gap Extension Penalty = 0.0, Weight Transition = No, Hydrophilic 

5 residues = Gly, Pro, Ser, Asn, Asp, Gin, Glu, Arg and Lys, Hydrophobic Gaps = Yes; 

and Scoring Matrix = BLOSUM. : . 

Figure 24 depicts a multiple sequence alignment of a portion of the pfotein 

i , ' 1 

sequences of the coaA gene products from the following microorganisms: Bacillus 
j * i I ' * 

subtilis, Escherichia coli, Haemophilus influenzae* Mycobacterium leprae, 

1 0 Mycobacterium tuberculosis, and Streptomyces coelicolor. The residues that are 

mutated in E. coli coaA15(Ts) and B. subtilis coaA282A are indicated below and above 
the alignment, respectively. The portions correspond to amino acid residues 168-187 of 
SEQIDNO:3, 167-186 of SEQ ID NO:2, 165-184 of SEQ ID NO:l, 169-188 ofSEQ 
IDNO:4, 169-188 of SEQ IDNO:5 and 179-198 of SEQ IDNO:6, respectively. 

15 Figure 25 is a schematic representation of the structure of pAN294, a plasmid 

for integrating mutagenized B. subtilis coaA at its native locus. 

Figure 26 is a schematic representation of the structure of pAN336, a plasmid 
designed to delete B. subtilis coaX from its chromosomal locus and replace it with a 
kanamycin resistence gene. 

20 

Detailed Description of the Invention 

The present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway 
(referred to herein as "panto-compounds", for example, pantothenate, ketopantoate, 
25 pantoate and P-alanine) using microorganisms in which the pantothenate biosynthetic 
pathway has been manipulated such that pantothenate or other desired panto-compounds 
are produced. 

The new and improved methodologies of the present invention include methods 
of producing panto-compounds (e.g., pantothenate) in microorganisms having at least 

30 one enzyme of the pantothenate biosynthetic pathway manipulated such that 

pantothenate or other desired panto-compounds are produced (e.g. , produced at an 
increased level). For example, the invention features methods of producing panto- 
compounds (e.g., pantothenate) in microorganisms having at least one of ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase or aspartate- 

35 a-decarboxylase manipulated such that pantothenate or other desired panto-compounds 
are produced. The methodologies of the present invention also include methods of 
producing panto-compounds (e.g., pantothenate) in microorganisms having at least one 
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yaline-isoleucine biosynthetic enzyme, described herein, manipulated such that 
pantothenate or other desired panto-compounds are produced. For example, the 
invention features methods of producing panto-compounds (e.g., pantothenate) in 
microorganisms having at least one of acetohydroxyacid synthetase, acetohydroxyacid 

isomeroreductase or dihydroxyacid dehydratase manipulated such that pantothenate or 

1 i 

other desired panto-compounds are produced. 

1 The invention also features methods of producing panto-compounds that involve 

culturing a ketopantoate reductase r overexpressing (KPAR-O) microorganism under 



conditions such that the panto-compound is produced. The invention also features 

I i i.< . 

10 methods of producing pantothenate in a manner independent of precursor feed that 

involve culturing an aspartate-a-decarboxylase-overexpressing (ActD-O) microorganism 
under conditions such that pantothenate is produced. Also featured are (3-alanine 
independent high yield pantothenate production methods as well as methods of 
producing P-alanine. The present invention also features methods for enhancing 

1 5 production of panto-compounds that involve culturing pantothenate kinase mutants. In 
particular, the present invention features new and improved methods of producing 
pantothenate and other key compounds of the pantothenate biosynthetic pathway (e.g. , 
panto-compounds) utilizing microorganisms having modified pantothenate kinase 
activity, for example, microorganisms that include the coaX gene or that include a 

20 mutant coaX gene, having reduced pantothenate kinase activity. 

In order that the present invention may be more readily understood, certain terms 
are first defined herein. 

The term "pantothenate biosynthetic pathway" includes the biosynthetic pathway 
involving pantothenate biosynthetic enzymes (e.g., polypeptides encoded by 

25 biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

intermediates or products), cofactors and the like utilized in the formation or synthesis 
of pantothenate. The term "pantothenate biosynthetic pathway" includes the 
biosynthetic pathway leading to the synthesis of pantothenate in a microorganisms (e.g., 
in vivo) as well as the biosynthetic pathway leading to the synthesis of pantothenate in 

30 vitro. Figure 1 includes a schematic representation of the pantothenate biosynthetic 
pathway. Pantothenate biosynthetic enzymes are depicted in bold and their 
corresponding genes indicated in italics. 

The term "pantothenate biosynthetic enzyme" includes any enzyme utilized in 
the formation of a compound (e.g., intermediate or product) of the pantothenate 

35 biosynthetic pathway. According to Figure 1, synthesis of pantoate from a- 

ketoi so valerate (a-KIV) proceeds via the intermediate, ketopantoate. Formation of 
ketopantoate is catalyzed by the pantothenate biosynthetic enzyme ketopantoate 
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hydroxymethyltransferase (the panB gene product). Formation of pantoate is catalyzed 

. 1 i ( ■ < > i \ . i p . . . ' i 

by the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene 
product). Synthesis of p-alanine from aspartate is catalyzed by the pantothenate 

i 

i 

biosynthetic enzyme aspartate-a-decarboxylase (the panD gene product). Formation of 
5 pantothenate from pantoate and P-alanine (e.g., condensation) is catalyzed by the 
pantothenate biosynthetic enzyme pantothenate synthetase (the papC gene product). 

The term "isoleucine-valine biosynthetic pathway" includes the biosynthetic 
pathway involving isoleucine-valine biosynthetic enzymes (e.g., polypeptides encoded 
by biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 

10 intermediates or products), cofactors and the like utilized in the formation or synthesis 
of conversion of pyruvate to valine or isoleucine. The term "isoleucine-valine 
biosynthetic pathway" includes the biosynthetic pathway leading to the synthesis of 
valine or 1 isoleucine in a microorganisms (e.g., in vivo) as well as the biosynthetic 
pathway leading to the synthesis of valine or isoleucine in vitro. Figure 1 includes a 

15 schematic representation of the isoleucine-valine biosynthetic pathway. Isoleucine- 
valine biosynthetic enzymes are depicted in bold italics and their corresponding genes 
indicated in italics 

The term "isoleucine-valine biosynthetic enzyme" includes any enzyme utilized 
in the formation of a compound (e.g., intermediate or product) of the isoleucine-valine 

20 biosynthetic pathway. According to Figure 1, synthesis of valine from pyruvate 

proceeds via the intermediates, acetolactate, a,P-dihydroxyisovalerate (a,p-DHIV) and 
a-ketoisovalerate (a-KIV). Formation of acetolactate from pyruvate is catalyzed by the 
isoleucine-valine biosynthetic enzyme acetohydroxyacid synthetase (the ilvBN gene 
product, or alternatively, the alsS gene product). Formation of ct,p-DHI V from 

25 acetolactate is catalyzed by the isoleucine-valine biosynthetic enzyme 

acetohydroxyacidisomero reductase (the ilvC gene product). Synthesis of a-KIV from 
a,p-DHIV is catalyzed by the isoleucine-valine biosynthetic enzyme dihydroxyacid 
dehydratase (the ilvD gene product). Moreover, valine and isoleucine can be 
interconverted by branched chain amino acid transaminases. 

30 As used herein, each of ketopantoate, pantoate, p-alanine and pantothenate are 

"panto-compounds". The term "panto-compound" includes a compound (e.g., a 
substrate, intermediate or product) in the pantothenate biosynthetic pathway which is 
downstream from a particular pantothenate biosynthetic enzyme. In one example, a 
panto-compound is downstream of the pantothenate biosynthetic enzyme ketopantoate 

35 hydroxymethyltransferase (the panB gene product) and can include ketopantoate, 

pantoate and/or pantothenate. In another example, a panto-compound is downstream of 
the pantothenate biosynthetic enzyme ketopantoate reductase (the panE gene product) 
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and can include pantoate and/or pantothenate. In yet another example, a pantpr 
compound is downstream of the pantothenate biosynthetic enzyme pantothenate 
synthetase (the panC gene product) and can include pantothenate. In yet afiother 
example, a panto-compound is downstream of the pantothenate biosynthetic enzyme 
aspartate-a-decarboxylase (the panD gene product) and can include P-alariine and/or 
pantothenate. = 

Preferred panto-compounds include pantothenate and pantoate: The term t 
pantothenate" includes the free acid form of pantothenate, also referred to as 
pantothenic acid" as well as any salt thereof (e.g., derived by replacing the acidic 
1 0 hydrogen of pantothenate or pantothenic acid with a cation, for example, calcium, 
sodium, potassium, ammonium), also referred to as a "pantothenate salt". The term 
"panto-compound" also includes alcohol derivatives of pantothenate. Preferred 
pantothenate salts are calcium pantothenate or sodium pantothenate. A preferred 
alcohol derivative is pantothenol. [Pantothenate salts and/or alcohols of the present 
1 5 invention include salts and/or alcohols prepared via conventional methods from the free 
acids described herein. In another embodiment, calcium pantothenate is synthesized 
directly by a microorganism of the present invention. A pantothenate salt of the present 
invention can likewise be converted to a free acid form of pantothenate or pantothenic 
acid by conventional methodology. 
20 The term "pantoate" includes the free acid form of pantoate, also referred to as 

"pantoic acid" as well as any salt thereof (e.g., derived by replacing the acidic hydrogen 
of pantoate or pantoic acid with a cation, for example, calcium, sodium, potassium, 
ammonium), also referred to as a "pantoate salt". Preferred pantoate salts are calcium 
pantoate or sodium pantoate. Pantoate salts of the present invention include salts 
25 prepared via conventional methods from the free acids described herein. A pantoate salt 
of the present invention can likewise be converted to a free acid form of pantoate or 
pantoic acid by conventional methodology. Moreover, a free acid form of pantoate or 
pantoic acid can be converted to pantolactone by conventional methodology. 

The term "CoA biosynthetic pathway" includes the biosynthetic pathway 
30 involving CoA biosynthetic enzymes (e.g., polypeptides encoded by biosynthetic 
enzyme-encoding genes), compounds (e.g., precursors, substrates, intermediates or 
products), cofactors and the like utilized in the formation or synthesis of CoA from 
pantothenate. A schematic representation of the CoA biosynthetic pathway in E. coli is 
set forth as Figure 16. (The pathway depicted is also presumed to be that utilized by 
35 other microorganisms.) The term "CoA biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of CoA in microorganisms (e.g., in vivo) as well as the 
biosynthetic pathway leading to the synthesis of CoA in vitro. The term "Coenzyme A 
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or CoA biosynthetic enzyme" includes any enzyme utilized in the formation of a 
compound (e.g., intermediate or product) of the Co ft biosynthetic pathway, for example, 
the coaA,panK or coaX gene product which catalyzes the phosphorylation of 
pantothenate to form 4'-phosphopantothenate, or the coaD gene product which catalyzes 

5 the conversion of 4*-phosphopantetheine to dephosphocoenzyme A. 

■ I* , . ■ ' i 

L Recombinant Microorganisms and Methods for Culturing 

Microorganisms Such That Panto-Compounds are Produced 

The methodologies of the present invention feature microorganisms, e.g., 

10 recombinant microorganisms, preferably including vectors or genes (e.g., wild-type 
and/or mutated genes) as described herein and/or cultured in a manner which results in 
the production of a desired product (e.g. a panto-compound or panto-compounds). The 
term "recombinant" microorganism includes a microorganism (e.g., bacteria, yeast cell, 
fungal cell, etc.) which has been genetically altered, modified or engineered (e.g., 

1 5 genetically engineered) such that it exhibits an altered, modified or different genotype 
and/or phenotype (e.g., when the genetic modification affects coding nucleic acid, 
sequences of the microorganism) as compared to the naturally-occurring microorganism 
from which it was derived. Preferably, a "recombinant" microorganism of the present 
invention has been genetically engineered such that it overexpresses at least one 

20 bacterial gene or gene product (e.g., a pantothenate or isoleucine- valine biosynthetic 

enzyme encoding-gene) as described herein, preferably a biosynthetic enzyme encoding- 
gene included within a recombinant vector as described herein and/or a biosynthetic 
enzyme expressed from a recombinant vector. The ordinary skilled will appreciate that 
a microorganism expressing or overexpressing a gene product produces or overproduces 

25 the gene product as a result of expression or overexpression of nucleic acid sequences 
and/or genes encoding the gene product. 

The term "manipulated microorganism" includes a microorganism that has been 
engineered (e.g., genetically engineered) or modified such that the microorganism has at 
least one enzyme of the pantothenate biosynthetic pathway and/or at least one enzyme of 

30 the isoleucine-valine biosynthetic pathway modified such that pantothenate or other 
desired panto-compounds are produced. Modification or engineering of such 
microorganisms can be according to any methodology described herein including, but 
not limited to, deregulation of a biosynthetic pathway and/or overexpression of at least 
one biosynthetic enzyme. A "manipulated" enzyme (e.g., a "manipulated" biosynthetic 

35 enzyme) includes an enzyme, the expression or production of which has been altered or 
modified such that at least one upstream or downstream precursor, substrate or product 



4 



WO 01/21772 , PCT/US00/25993 

-16- 

• ■ > 

i ( ' i 

of the enzyme is altered or modified, for example, as compared to a corresponding wildr 
type or naturally occurring eirzyme. ■ i ' 

The term "oyerexpressed" or "overexpression" includes expression of a gene 
product (e.g., a pantothenate biosynthetic enzyme or isoleucine- valine biosynthetic 
5 enzyme) at a level greater than that expressed prior to. manipulation of the 

microorganism or in a comparable microorganism wliich has not been manipulated. In 
one embodiment, the microorganism cap be genetically manipulated (e.g., genetically 
engineered) to overexjpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 

10 been manipulated. Genetic manipulation can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
(e.g., by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 
chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 

15 particular gene such as a ribosome binding site or transcription terminator, increasing 

i 

the copy number of a particular gene, modifying proteins (e.g., regulatory proteins; 
suppressors, enhancers, transcriptional activators and the like) involved in transcription 1 
of a particular gene and/or translation of a particular gene product, or any other 
conventional means of deregulating expression of a particular gene routine in the art 

20 (including but not limited to use of antisense nucleic acid molecules, for example, to 
block expression of repressor proteins). 

In another embodiment, the microorganism can be physically or environmentally 
manipulated to overexpress a level of gene product greater than that expressed prior to 
manipulation of the microorganism or in a comparable microorganism which has not 

25 been manipulated. For example, a microorganism can be treated with or cultured in the 
presence of an agent known or suspected to increase transcription of a particular gene 
and/or translation of a particular gene product such that transcription and/or translation 
are enhanced or increased. Alternatively, a microorganism can be cultured at a 
temperature selected to increase transcription of a particular gene and/or translation of a 

30 particular gene product such that transcription and/or translation are enhanced or 
increased. 

The term "deregulated" or "deregulation" includes the alteration or modification 
of at least one gene in a microorganism that encodes an enzyme in a biosynthetic 
pathway, such that the level or activity of the biosynthetic enzyme in the microorganism 
35 is altered or modified. Preferably, at least one gene that encodes an enzyme in a 

biosynthetic pathway is altered or modified such that the gene product is enhanced or 
increased. The phrase "deregulated pathway" can also include a biosynthetic pathway in 
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which more than one gene that encodes an enzyme in a biosynthetic pathway is altered 
or modified such that the level or activity of more than one biosynthetic .enzyme i!s 
altered or modified. The ability to "deregulate" a pathway (e.g., to simultaneously 
deregulate more than one gene in a given biosynthetic pathway) in a microorganism 
5 arises from the particular phenomenon of microorganisms in which more than one 
enzyme (e.g., two or three biosynthetic enzymes) are encoded by genes occurring 

adjacent to one another on a contiguous piece of genetic material termed an "operon". 

' ' j i." I 1 

The term "operon" includes a coordinated unit of gene expression that contains a 

1 promoter and possibly a regulatory element associated with one or more, preferably at 

10 least two, structural genes (e.g., genes encoding enzymes, for example, biosynthetic 
enzymes). Expression of the structural genes can be coordinately regulated, for 
example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The structural genes can be transcribed to give a single mRNA that 
encodes all of the structural proteins. Due to the coordinated regulation of genes 

15 included in an operon, alteration or modification of the single promoter and/oi; 

regulatory element can result in alteration or modification of each gene product encoded 
by the operon. Alteration or modification of the regulatory element can include, but is 
not limited to removing the endogenous promoter and/or regulatory element(s), adding 
strong promoters, inducible promoters or multiple promoters or removing regulatory 

20 sequences such that expression of the gene products is modified, modifying the 

chromosomal location of the operon, altering nucleic acid sequences adjacent to the 
operon or within the operon such as a ribosome binding site, increasing the copy number 
of the operon, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of the operon and/or 

25 translation of the gene products of the operon, or any other conventional means of 

deregulating expression of genes routine in the art (including but not limited to use of 
antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Deregulation can also involve altering the coding region of one or more genes 
to yield, for example, an enzyme that is feedback resistant or has a higher or lower 

30 specific activity. 

A particularly preferred "recombinant" microorganism of the present invention 
has been genetically engineered to overexpress a bacterially-derived gene or gene 
product. The term "bacterially-derived" or "derived-from", for example bacteria, 
includes a gene which is naturally found in bacteria or a gene product (e.g., ketopantoate 

35 hydroxy methyltransferase, ketopantoate reductase, pantothenate synthetase, aspartate-a- 
decarboxylate, acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase or 
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(dihydroxyacid dehydratase) which is encoded by a bacterial gene (e.g., encoded by 
panB, panE, panC, panD, ilvB, ilvN, alsS, ilvC, or ilvD). ' ( 

The methodologies of the present invention feature recombinant microorganisms 
which overexpress at least one of ketopantoate hydroxymethyltransferase, ketopantoate 
5 reductase, pantothenate synthetase or aspartate-a-decarboxylase. A particularly 
preferred recombinant microorganism of the present invention has been genetically 

engineered to overexpress a Bacillus (e.g., Bacillus lie fieniformis, Bacillus ' i 

I * 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis, and Bacillus pumilus, etc.) 
biosynthetic enzyme (e.g., has been engineered to overexpress at least one of B. subtilis 
10 ketopantoate reductase (the panE gene product) (e.g., ketopantoate reductase having the 
amino acid sequence of SEQ ID NO:30 or encoded by the nucleic acid sequence of SEQ 
ID NO:29), B. subtilis ketopantoate hydroxymethyltransferase (the panB gene product) 

j 

(e.g., ketopantoate hydroxymethyltransferase having the amino acid sequence of SEQ 
ID NO:24 or encoded by a nucleic acid molecule having the nucleotide sequence of 

15 SEQ ID NO:23), B. subtilis pantothenate synthetase (the panC gene product) (e.g., 

pantothenate synthetase having the amino acid sequence of SEQ ID NO:26 or encoded 
by a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25) and/or B. 
subtilis aspartate-a-decarboxylase (the panD gene product) (e.g., aspartate-a- 
decarboxylase having the amino acid sequence of SEQ ID NO:28 or encoded by a 

20 nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27). 

In an exemplary embodiment, the invention features a microorganism (e.g., a 
KPAR-O microorganism) that has been transformed with a vector comprising a panE 
nucleic acid sequence (e.g., a panE nucleic acid seqeunce as set forth in SEQ ID 
NO:29). In another embodiment, the invention features a microorganism that has been 

25 transformed with a vector comprising a panB nucleic acid sequence (e.g., a panB nucleic 
acid sequence as set forth in SEQ ID NO:23), a vector comprising a panC nucleic acid 
sequence (e.g., a panC nucleic acid sequence as set forth in SEQ ID NO:25) or a vector 
comprising a panD nucleic acid sequence (e.g., a panD nucleic acid sequence as set 
forth in SEQ ID NO:27). In yet another embodiment, the invention features a 

30 microorganism having a deregulated panBCD operon (e.g., SEQ ID NO:59). 

Other preferred "recombinant" microorganisms of the present invention have a 
deregulated^isoleucine-valine (z?v) pathway. The phrase "microorganism having a 
deregulated isoleucine-valine (/7v) pathway" includes a microorganism having an 
alteration or modification in at least one gene encoding an enzyme of the isoleucine- 

35 valine (z/v) pathway or having an alteration or modification in an operon including more 
than one gene encoding an enzyme of the isoleucine-valine (z/v) pathway. A preferred 
"microorganism having a deregulated isoleucine-valine (z7v) pathway" has been 
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genetically engineered to overexpress a Bacillus (e.g., B. subtilis) ilv biosynthetic 
enzyme {e.g., has been engineered to overexpress at least one of acetohydroxyacid 
synthetase (the ilvBN gene products or the alsS gene product) (e.g., acetohydroxyacid 
synthetase having subunits having the amino acid sequences of SEQ ID NO:32 and SEQ 
5 ID NO:34 or encoded by pucleic acid molecules having the nucleotide sequence of SEQ 
ID NO:31 and SE;Q ID NO:33 or the nucleotide sequence of SEQ }D NO:58 from 
nucleotides 1-2246 or acetohydroxyacid synthetase encoded by a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:86), acetohydroxyacid isomeroreductase 
(the ilvC gene product) (e.g., acetohydroxyacid isomeroreductase having the amino acid 

10 sequence of SEQ ID NO:36 or encoded by a nucleic acid molecule having the 

nucleotide sequence of SEQ ID NO:35), dihydroxyacid dehydratase (the ilvD gene 
product) (e.g., dihydroxyacid dehydratase having the amino acid sequence of SEQ ID 
NO:38 oi* encoded by a nucleic acid molecule having the nucleotide sequence of SEQ 
ID NO:37), and/or has been transformed with a vector comprising an ilvBNC nucleic 

15 acid sequence (SEQ ID NO:58, coding regions from nucleotides 1-1725, 1722-2246 and 
2263-3291) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

In another preferred embodiment, a recombinant microorganism is designed or 
engineered such that a mutant CoaA and/or CoaX biosynthetic enzyme is expressed and 
at least one pantothenate biosynthetic enzyme and/or at least one isoleucine-valine 

20 biosynthetic enzyme is overexpressed or deregulated. 

In another preferred embodiment, a microorganism of the present invention 
overexpresses or is mutated for a gene or biosynthetic enzyme (e.g., a CoA biosynthetic 
enzyme, pantothenate biosynthetic enzyme or isoleucine-valine biosynthetic enzyme) 
which is bacterially-derived. The term "bacterially-derived" or "derived-from", for 

25 example bacteria, includes a gene product (e.g., ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase, aspartate-a-decarboxylate, 
acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase, dihydroxyacid 
dehydratase or pantothenate kinase) which is encoded by a bacterial gene (e.g.,panB, 
panE, panC, panD, ilvBN (or alsS), ilvC, ilvD, or encoded by coaA or coaX). 

30 Still other preferred recombinant microorganisms of the present invention are 

mutant microorganisms. As used herein, the term "mutant microorganism" includes a 
recombinant microorganism that has been genetically engineered to express a mutated 
gene or protein that is normally or naturally expressed by the microorganism. 
Preferably, a mutant microorganism expresses a mutated gene or protein such that the 

35 microorganism exhibits an altered, modified or different phenotype (e.g., has been 

engineered to express a mutated CoaA biosynthetic enzyme, for example, pantothenate 
kinase). In one embodiment, a mutant microorganism is designed or engineered such 
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that it includes a mutant coaX gene, as defined herein. Jn another embodiment, a 

recombinant microorganism is designed or engineered such that it includes a mutant 

i 1 t 

coaA gene, as defined herein. In another embodiment, a mutant microorganism is 
designed or engineered such that a coaX gene has been deleted (i.e., the protein encoded 

t | i * 

5 by the coaX gene is not produced). In another embodiment, a mutant microorganism 1 is 
designed or engineered such that a coaA gene has been deleted (i.e., the protein encoded 
by the coaA gene is not produced). Preferably, a mutant microorganism has a mutant 

coaX gene or a mutant coaA gene, or has been engineered to have a coaX gene and/or 

i 1 

coaA deleted, such that that the mutant microorganism encodes a "reduced pantothenate 
10 kinase activity". In the context of a whole microorganism, a "reduced pantothenate 
kinase activity" can be (determined by measuring or assaying for a decrease in an 
intermediate or product of the CoA biosynthetic pathway, for example, measuring or 
assaying for 4'-phosphopantothenate, 4' -phosphopantothenyl cysteine, 4'- 
phosphopantetheine, dephosphocoenzyme A, Coenzyme A, apo-acyl carrier protein 
15 (apo-ACP) or holo-acyl carrier protein (ACP) in the microorganism (e.g., in a lysate 
isolated or derived from the microorganism) or in the medium in which the 

i 

microorganism is cultured (see e.g., Figure 16). Alternatively, a "reduced pantothenate 
kinase activity" can be determined by measuring or assaying for decreased growth of the 
microorganism. Alternatively, a "reduced pantothenate kinase activity" can be 

20 determined by measuring or assaying for an increase in a panto-compound (e.g., 
pantothenate) in the microorganism or surrounding media, as panto-compounds lie 
upstream of the CoA biosynthetic pathway, the first step of which is catalyzed by 
pantothenate kinase. The invention also features recombinant microorganisms that, in 
addition to having reduced pantothenate kinase activity (e.g., expressing mutant coaA 

25 and/or mutant coaX genes) have a deregulated pantothenate biosynthesis pathway and/or 
a deregulated isoleucine-valine (ilv) biosynthetic pathway. 

In one embodiment, a recombinant microorganism of the present invention is a 
Gram positive organism (e.g., a microorganism which retains basic dye, for example, 
crystal violet, due to the presence of a Gram-positive wall surrounding the 

30 microorganism). In a preferred embodiment, the recombinant microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. In a more preferred 
embodiment, the recombinant microorganism is of the genus Bacillus. In another 
preferred embodiment, the recombinant microorganism is selected from the group 

35 consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus firmus, 

Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulans, 
Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus pumilus, 
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Bacillus thuringiensis, and other Group 1 Bacillus species, for example, as characterized 
by 16S rRNA type (Priest (1993) in Bacillus subtilis and Other Gram-Positive Bacteria 
eds. Sonenshein et al. 9 ASM, Washington, D.C., p. 6). Inanother preferred 

j 

embodiment, the recombinant microorganism is Bacillus brevis or Bacillus 
5 stearothermophilus. In another preferred embodiment, the recombinant microorganism 
is selected from the group consisting of Bacillus lictieniformis, Bacillus 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis \ and Bacillus pumilus. , 

I ' 

In another embodiment, the recombinant microorganism is a Gram negative 

• ' 1 i '.' 1 

(excludes basic dye) organism. In a preferred embodiment, the recombinant 
10 microorganism is a microorganism belonging to a genus selected from the group 

i 

consisting of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more 
preferred embodiment, the recombinant microorganism is of the genus Escherichia, In 
an even more preferred embodiment, the recombinant microorganism is Escherichia 
coli. In another embodiment, the recombinant microorganism is Saccharomyces (e.g., S. 
15 cerevisiae). 

An important aspect of the present invention involves culturing the recombinant 
microorganisms described herein, such that a desired compound (e.g., a desired panto- 
compound) is produced. The term "culturing" includes maintaining and/or growing a 
living microorganism of the present invention (e.g., maintaining and/or growing a 

20 culture or strain). In one embodiment, a microorganism of the invention is cultured in 
liquid media. In another embodiment, a microorganism of the invention is cultured in 
solid media or semi-solid media. In a preferred embodiment, a microorganism of the 
invention is cultured in media (e.g., a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism (e.g., 

25 carbon sources or carbon substrate, for example complex carbohydrates such as bean or 
grain meal, starches, sugars, sugar alcohols, hydrocarbons, oils, fats, fatty acids, organic 
acids and alcohols; nitrogen sources, for example, vegetable proteins, peptones, peptides 
and amino acids derived from grains, beans and tubers, proteins, peptides and amino 
acids derived form animal sources such as meat, milk and animal byproducts such as 

30 peptones, meat extracts and casein hydrolysates; inorganic nitrogen sources such as 
urea, ammonium sulfate, ammonium chloride, ammonium nitrate and ammonium 
phosphate; phosphorus sources, for example, phosphoric acid, sodium and potassium 
salts thereof; trace elements, for example, magnesium, iron, manganese, calcium, 
copper, zinc, boron, molybdenum, and/or cobalt salts; as well as growth factors such as 

35 amino acids, vitamins, growth promoters and the like). 
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Preferably,i microorganisms of thepresent invention are cultured under . 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product (e.g., a panto-compound). In one embodiment, microorganisms are 
cultured at a pH of about 7. In another embodiment, microorganisms are cultured at a 
5 pH of between 6.0 and 8.5 1 . The desired pH may be maintained by any number of 
methods known to those skilled in the art. > 

Also preferably, microorganisms of the present invention are cultured under 
controlled aeration. The term "controlled aeration" includes sufficient aeration (e.g., 

oxygen) to result in production of the desired product (e.g., panto-compound). In one 

i 1 

10 embodiment, aeration is controlled by regulating oxygen levels in the culture, for 

example, by regulating the amount of oxygen dissolved in culture media. Preferably, 

. -i 

aeration of the culture is controlled by agitating the culture. Agitation may be provided 

■ i 

by a propeller or similar mechanical agitation equipment, by revolving or shaking the 
growth vessel (e.g., fermentor) or by various pumping equipment. Aeration may be 

1 5 further controlled by the passage of sterile air or oxygen through the medium (e.g., 
through the fermentation mixture). Also preferably, microorganisms of the present 
invention are cultured without excess foaming (e.g., via addition of antifoaming agents). ! 

Moreover, microorganisms of the present invention can be cultured under 
controlled temperatures. The term "controlled temperature" includes any temperature 

20 which results in production of the desired product (e.g., a panto-compound). In one 

embodiment, controlled temperatures include temperatures between 15°C and 95°C. In 
another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°C and 45°C or between 30°C and 50°C. 

25 Microorganisms can be cultured (e.g., maintained and/or grown) in liquid media 

and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g., rotary 
shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 

30 preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 

fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous methods of fermentation. The phrase 
"batch process" or "batch fermentation" refers to a closed system in which the 
composition of media, nutrients, supplemental additives and the like is set at the 

35 beginning of the fermentation and not subject to alteration during the fermentation, 

however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
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batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates Or supplements are added (e.g., added in 1 
increments or continuously) as the fermentation progresses. The phrase "continuous 
process" or "continuous fermentation" refers to a system in which a defined 
5 fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g., panto-compound). X variety of such processes have been developed and 
are well-known in the art. 

The phrase "culturing under conditions such that a desired compound (e.g., a 
10 panto-compound, for example, pantothenate) is produced" includes maintaining and/or 
growing microorganisms under conditions (e.g., temperature, pressure, pH, duration, 
etc.) appropriate or sufficient to obtain production of the desired compound or to obtain 
desired yields of the particular compound being produced. For example, culturing is 

continued for a time sufficient to produce the desired amount of a panto-compound (e.g., 

j i 

15 pantothenate, pantoate or [3-alanine). Preferably, culturing is continued for a time 
sufficient to substantially reach maximal production of the panto-compound. In one 
embodiment, culturing is continued for about 12 to 24 hours. In another embodiment, 
culturing is continued for about 24 to 36 hours, 36 to 48 hours, 48 to 72 hours, 72 to 96 
hours, 96 to 120 hours, 120 to 144 hours, or greater than 144 hours. In another 

20 embodiment, culturing is continued for a time sufficient to reach production yields of 
panto-compound, for example, cells are cultured such that at least about 15 to 20 g/L of 
panto-compound are produced, at least about 20 to 25 g/L panto-compound are 
produced, at least about 25 to 30 g/L panto-compound are produced, at least about 30 to 
35 g/L panto-compound are produced, at least about 35 to 40 g/L panto-compound are 

25 produced (e.g., at least about 37 g/L panto-compound) or at least about 40 to 50 g/L 
panto compound are produced. In yet another embodiment, microorganisms are 
cultured under conditions such that a preferred yield of panto-compound, for example, a 
yield within a range set forth above, is produced in about 24 hours, in about 36 hours, in 
about 48 hours, in about 72 hours, or in about 96 hours. 

30 The methodology of the present invention can further include a step of 

recovering a desired compound (e.g., a panto-compound). The term "recovering" a 
desired compound (e.g., a panto-compound) includes extracting, harvesting, isolating or 
purifying the compound from culture media. Recovering the compound can be 
performed according to any conventional isolation or purification methodology known 

35 in the art including, but not limited to, treatment with a conventional resin (e.g., anion or 
cation exchange resin, non-ionic adsorption resin, etc.), treatment with a conventional 
adsorbent (e.g., activated charcoal, silicic acid, silica gel, cellulose, alumina, etc.), 
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alteration of pH, solvent extraction (e.g., with a conventional solvent such as an alcohol, 
ethyl acetate, hexane and the like), dialysis, filtration, concentration, crystallization, 
recrystallization, pH adjustment, lyophilization and the like. For example, a compound 
(e.g., a panto-compound) can be recovered from culture media by first removing the 
5 microorganisms from the culture. Media is then passed through or over, a cation 

exchange resin to remove unwanted cations and then through or. over an anion exchange 
resin to remove unwanted inorganic anions and organic acids having stronger acidities 
than the panto-compound of interest (e.g., pantothenate). The resulting panto-compound 
(e.g., pantothenate) can subsequently be converted to a pantothenate salt (e.g., calcium 
10 pantothenate) as described herein.! 

Preferably, a desired compound of the present invention's "extracted", 
"isolated" or "purified" such that the resulting preparation is substantially free of other 
components (e.g., free of media components and/or fermentation byproducts). The 
language "substantially free of other components" includes preparations of desired 

i 1 

15 compound in which the compound is separated (e.g., purified or partially purified) from 
media components or fermentation byproducts of the culture from which it is produced. 
In one embodiment, the preparation has greater than about 80% (by dry weight) of the 
desired compound (e.g., less than about 20% of other media components or fermentation 
byproducts), more preferably greater than about 90% of the desired compound (e.g., less 

20 than about 10% of other media components or fermentation byproducts), still more 

preferably greater than about 95% of the desired compound (e.g., less than about 5% of 
other media components or fermentation byproducts), and most preferably greater than 
about 98-99% desired compound (e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound is a panto-compound that has 

25 been derivatized to a salt (e.g. a pantothenate salt or pantoate salt), the panto-compound 
is preferably further free (e.g., substantially free) of chemical contaminants associated 
with the formation of the salt. When the desired compound is a panto-compound that 
has been derivatized to an alcohol, the panto-compound is preferably further free (e.g., 
substantially free) of chemical contaminants associated with the formation of the 

30 alcohol. 

In an alternative embodiment, the desired panto-compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
(e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
source of product (e.g., crude product). In one embodiment, the culture (or culture 
35 supernatant) supernatant is used without modification. In another embodiment, the 

culture (or culture supernatant) is concentrated. In yet another embodiment, the culture 
(or culture supernatant) is dried or lyophilized. 
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// Panto-Compound Production Methodologies Featuring Ketopantoate 

Reductase-Overexpressing Microorganisms , 1 

1 1 • i i 

One aspect of the invention features methods of producing a panto-compounds 
that involve culturing a ketopantoate reductase-overexpressing (KPAR-O) 
5 microorganism under conditions such that the panto-compound is produced. The term 
"ketopantoate reductase-overexpressing (KPAR-O) microorganism" includes a 
microorganism which has been manipulated such that ketopantoate reductase is 
overexpressed {e.g. , a B. subtilis ketopantoate reductase protein having the amino acid 
sequence of SEQ ID NO:30) and/or has been transformed with a vector comprising a 
10 panEl nucleic acid sequence (e.g., a B. subtilis panEl nucleic acid sequence as set forth 

i 

in SEQ ID NO:29). In one embodiment, the panto-compound is pantothenate. In 
another embodiment, the panto-compound is pantoate. In another embodiment, the 
ketopantoate reductase is bacterial-derived. In another emtiodiemnt, the ketopantoate 
reductase is derived from Bacillus (e.g., is derived from Bacillus subtilis). In yet 

1 5 another embodiment, the KPAR-O microorganism is Gram positive. In yet another 
embodiment, the KPAR-O microorganism is a microorganism belonging to a genus 
selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 
Lactococci and Streptomyces. In a preferred embodiemnt, the KPAR-O microorganism 
is of the genus Bacillus. In a more preferred embodiment, the KPAR-O microorganism 

20 is selected from the group consisting of Bacillus licheniformis, Bacillus 

amyloliquefaciens, Bacillus halodurans, Bacillus subtilis and Bacillus pumilus. In a 
particularly preferred embodiemnt, the KPAR-O microorganism is Bacillus subtilis. 

In still other embodiments, the KPAR-O microorganism further overexpresses at 
least one pantothenate biosynthetic enzyme in addition to ketopantoate reductase. In an 

25 exemplary embodiment, the KPAR-O microorganism further overexpresses at least one 
of ketopantoate hydroxymethyltransferase, pantothenate synthetase and aspartate-a- 
decarboxylase. Also featured are methods of producing panto-compounds, for example, 
methods that involve culturing a KPAR-O microorganism, which further include the 
step of recovering the panto-compound. 

30 

III Methods of Producing Panto-Compounds Independent of Precursor Feed 
Requirements 

Depending on the biosynthetic enzyme or combination of biosynthetic enzymes 
manipulated, it may be desirable or necessary to provide (e.g., feed) microorganisms of 
35 the present invention at least one pantothenate biosynthetic precursor such that 

pantothenate or other desired panto-compounds are produced. The term "pantothenate 
biosynthetic precursor" or "precursor" includes an agent or compound which, when 
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provided to, brought into contact with, or, included in the culture medium ofa . 
microorganism, serves to enhance or increase pantothenate biosynthesis. In one 
embodiment, the pantothenate biosynthetic precursor or precursor is aspartate. In 
another embodiment, the pantothenate biosynthetic precursor or precursor is p-alanine. 

5 The amount of aspartate or p-alanine added is preferably an amount that results in a 

' • *•'.■''<•"• 

concentration in the culture medium sufficient to enhance productivity of the 

• '• ■ . ' ' 
microorganism (e.g., a concentration sufficient to enhaijce production ofa panto- 

1 ' 1 ' j 1 

compound, for example, P-alanine, ketopantdate, pantoate or pantothenate). 

Pantothenate biosynthetic precursors of the present invention can be added in the form 

10 ofa concentrated solution or suspension {e.g., in a suitable solvent such as water or 

buffer) or in the form of a solid (e.g., in the form of a powder). Moreover, pantothenate 
biosynthetic precursors of the present invention can be added as a single aliquot, 
continuously or intermittently over a given period of time. 

In yet another embodiment, the pantothenate biosynthetic precursor is valine, see 

1 5 e.g., Example III: In yet another embodiment, the pantothenate biosynthetic precursor is 
ot-ketoisovalerate. Preferably, valine or a-ketoisovalerate is added in an amount that 
results in a concentration in the medium sufficient for production of the desired product 
(e.g., panto-compound) to occur. Pantothenate biosynthetic precursors are also referred 
to herein as "supplemental pantothenate biosynthetic substrates". 

20 Providing pantothenate biosynthetic precursors in the pantothenate biosynthetic 

methodologies of the present invention, can be associated with high costs, for example, 
when the methodologies are used to produce high yields of panto-compounds. 
Accordingly, preferred methodologies of the present invention feature microorganisms 
having at least one biosynthetic enzyme or combination of biosynthetic enzymes (e.g., at 

25 least one pantothenate biosynthetic enzyme and/or valine-isoleucine biosynthetic 
enzyme) manipulated such that pantothenate or other desired panto-compounds are 
produced in a manner independent of precursor feed. The phrase "a manner 
independent of precursor feed", for example, when referring to a method for producing a 
desired compound (e.g., a panto-compound), includes an approach to or a mode of 

30 producing the desired compound that does not depend or rely on precursors being 
provided (e.g., fed) to the microorganism being utilized to produce the desired 
compound. For example, microorganisms featured in the methodologies of the present 
invention can be used to produce panto-compounds in a manner requiring no feeding of 
the precursors aspartate, P-alanine, valine and/or a-KIV. 

35 Alternative preferred methodologies of the present invention feature 

microorganisms having at least one biosynthetic enzyme or combination of biosynthetic 
enzymes manipulated such that pantothenate or other desired panto-compounds are 
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produced in a manner substantially independent of precursor feed. The phrase "a 
manner substantially independent of precursor feed" includes an approach to or a 1 
method of producing the desired compound that depends pr relies to a lesser extent on 
precursors being provided (e.g., fed) to the microorganism being utilized. For example, 
5 microorganisms featured in the methodologies of the present invention can be used to 
produce panto-compounds in a manner requiring feeding of substantially reduced 
amounts of the precursors aspartate, p-alanine, valine and/or a-KTV. Ipone ' - 
embodiment, the invention features methods of producing panto-compounds (e.g., 
pantothenate) in a manner that requires feeding of less than 5%-10% of the amount of 

10 precursor required by a control microorganism (e.g., a microorganism that is dependent, 
for example is wholly dependent, on precursor feed to efficiently produce the desired 
compound). In another embodiment, the invention features methods of producing 
panto-compounds in a manner that requires feeding of less than 15-20% of the amount 
of precursor required by a control microorganism. In another embodiment, the 

15 invention features methods of producing panto-compounds in a manner that requires 
feeding of less than 25-30%, 35-40%, 45-50% or 55-60% of the amount of precursor 
required by a control microorganism. As described in Examples I-III herein, particular 
microorganisms featured in the methodologies of the present invention require, for 
example, 5 g/L of aspartate, P-alanine, valine or a-KIV (e.g., in test tube or in shake 

20 flask cultures). Accordingly, in a preferred embodiment, the present invention features 
methods of producing panto-compounds (e.g., pantothenate) in a manner requiring 
feeding of less than 0.25 g/L, 0.5 g/L, 0.75 g/L, 1 g/L, 1.25 g/L, 1.5 g/L, 1.75 g/L, 2 g/L, 
2.25 g/L, 2.5 g/L, 2.75 g/L or 3 g/L. 

Preferred methods of producing desired compounds (e.g., panto-compounds) in a 

25 manner independent of precursor feed or alternatively, in a manner substantially 
independent of precursor feed, involve culturing microorganisms which have been 
manipulated (e.g., designed or engineered, for example, genetically engineered) such 
that expression of at least one pantothenate biosynthetic enzyme, and/or at least one 
isoleucine-valine biosynthetic enzyme, is modified. For example, in one embodiment, a 

30 microorganism is manipulated (e.g. , designed or engineered) such that the production of 
at least one pantothenate biosynthetic enzyme, and/or at least one isoleucine/valine 
biosynthetic enzyme is deregulated. In a preferred embodiment, a microorganism is 
manipulated (e.g., designed or engineered) such that it has a deregulated biosynthetic 
pathway, for example, a deregulated pantothenate biosynthesis pathway and/or a 

35 deregulated isoleucine-valine biosynthetic pathway, as defined herein. In another 

preferred embodiment, a microorganism is manipulated (e.g., designed or engineered) 
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such that at least one pantothenate biosynthetic enzyme, and/or, at least pne isoleucine- 
valine biosynthetic enzyme is overexpressed. ' , ■ 

Preferred methods of producing desired compounds (e.g., pantb-cohipounds) in a 
manner independent of precursor feed or alternatively, in a manner substantially 
5 independent of precursor feed, are as follows. In one embodiment, the invention 

features a method of producing pantothenate in a manner independent of precursor feed 
comprising culturing an aspartate-ja-decarboxylase-overexpressing (AaD-O) i 
microorganism having a deregulated isoleucine-valine (z7v) pathway under conditions 
such that pantothenate is produced. In another embodiment, the invention features a 
10 method of producing pantothenate! in a manner independent of precursor feed 
comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD-O) 
microorganism having a deregulated pantothenate (pan) pathway and a deregulated 
isoleucine-valine (//v) pathway, under conditions such that pantothenate is produced. In 

* 

another embodiment, the invention features a method of producing pantothenate in a 

i 1 

15 manner independent of aspartate or p-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 
pantothenate is produced. In yet another embodiment, the invention features a method 
of producing pantothenate in a manner independent of valine or a-ketoisovalerate feed 
comprising culturing a microorganism having a deregulated isoleucine-valine (ilv) 

20 biosynthetic pathway under conditions such that pantothenate is produced. 

The term "aspartate-a-decarboxylase-overexpressing (AaD-O) microorganism" 
includes a microorganism which has been manipulated such that aspartate-a- 
decarboxylase is overexpressed. A preferred "aspartate-a-decarboxylase-overexpressing 
(AaD-O) microorganism" has been transformed with a vector comprising a B. subtilis 

25 panD nucleic acid sequence (e.g., a panD nucleic acid sequence that encodes an 

aspartate-a-decarboxylase protein having the amino acid sequence of SEQ ID NO:28, 
for example, a panD nucleic acid sequence as set forth in SEQ ID NO:27). 

The phrase "microorganism having a deregulated isoleucine-valine (ilv) 
pathway" includes a -microorganism having an alteration or modification in at least one 

30 gene encoding an enzyme of the isoleucine-valine (/7v) pathway or having an alteration 
or modification in an operon including more than one gene encoding an enzyme of the 
isoleucine-valine (i/v) pathway. A preferred "microorganism having a deregulated 
isoleucine-valine (/7v) pathway" overexpresses acetohydroxyacid synthetase (e.g., 
acetohydroxyacid synthetase having subunits having the amino acid sequences of SEQ 

35 ID NO:32 and SEQ ID NO:34 or acetohydroxyacid synthetase having the amino acid 
sequence of SEQ ID NO:87), acetohydroxyacid isomeroreductase (having the amino 
acid sequence of SEQ ID NO:36), or dihydroxyacid dehydratase (having the amino acid 
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sequence of SEQ ID NO:38) and/or has been transformed with a vector comprising ilvB, 
UvN, ilvC, ilvBN, ilyBNC or alsS nucleic acid sequences (SEQ ID NO:3 1 , SEQ ID 
NO:33, SEQ ID NO:35, nucleotides 1 T 2246 of SEQ ID NO:58, SEQ ID NO:58 having 
coding regions from nucleotides 1-1725, 1722-2246 and 2263-3291, or SEQ ID NO:86, 

5 respectively) and/or an ilvD nucleic acid sequence (SEQ ID NO:37). 

i. i i ' ' 

IV. High Yield Production Methodologies ' 
1 A' particularly preferred embodiment pf the present invention is a high yield 
production method for producing pantothenate comprising culturing a manipulated 
10 microorganism under conditions such that pantothenate is produced at a significantly 
high yield. The phrase "high yield production method", for example, a high yield 

i 

I 

production method for producing a desired compound (e.g., for producing a panto- 
compound) includes a method that results in production of the desired compound at a 
level which is elevated or above what is usual for comparable production methods. 

i t t i 

15 Preferably, a high yield production method results in production of the desired 

compound at a significantly high yield. The phrase "significantly high yield" includes a 
level of production or yield which is sufficiently elevated or above what is usual for 
comparable production methods, for example, which is elevated to a level sufficient for 
commercial production of the desired product (e.g., production of the product at a 

20 commercially feasible cost). In one embodiment, the invention features a high yield 
production method of producing pantothenate that includes culturing a manipulated 
microorganism under conditions such that pantothenate is produced at a level greater 
than 2 g/L. In another embodiment, the invention features a high yield production 
method of producing pantothenate that includes culturing a manipulated microorganism 

25 under conditions such that pantothenate is produced at a level greater than 10 g/L. In 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 20 g/L. In yet 
another embodiment, the invention features a high yield production method of 

30 producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 30 g/L. In yet 
another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 40 g/L. 

35 The invention further features a high yield production method for producing a 

desired compound (e.g., for producing a panto-compound) that involves culturing a 
manipulated microorganism under conditions such that a sufficiently elevated level of 
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compound is produced within a commercially desireable period of time,. In an., 
exemplary embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
conditions such that pantothenate is produced at a level greater than 15-20 g/L in 36 
5 hours. In another embodiment, the invention features a high yield production method of 

producing pantothenate that includes culturing a manipulated microorganism under 

1 f i i ■ i 

conditions such that pantothenate ! is produced at a level greater than 25-30 g/L in 48 ■ 
hours. In another embodiment, the invention features a high yield production method of 
producing pantothenate that includes culturing a manipulated microorganism under 
10 conditions such that pantothenate is produced at a level greater than 35-40 g/L in 72 
hours, for example, greater that 37 g/L in 72 hours. In another embodiment, the 
invention features a high yield production method of producing pantothenate that 
includes culturing a manipulated microorganism under conditions such that pantothenate 

is produced at a level greater than ;30-40 g/L in 60 hours, for example, greater that 30, 35 

i 1 

15 or 40 g/L in 60 hours. Values and ranges included and/or intermediate within the ranges 
set forth herein are also intended to be within the scope of the present invention. For 
example, pantothenate production at levels of at least 31, 32, 33, 34, 35, 36, 37, 38 and 
39 g/L in 60 hours are intended to be included within the range of 30-40 g/L in 60 hours. 
In another example, ranges of 30-35 g/L or 35-40 g/L are intended to be included within 

20 the range of 30-40 g/L in 60 hours. Moreover, the skilled artisan will appreciate that 
culturing a manipulated microorganism to achieve a production level of, for example, 
"30-40 g/L in 60 hours" includes culturing the microorganism for additional time 
periods (e.g., time periods longer than 60 hours), optionally resulting in even higher 
yields of pantothenate being produced. 

25 

K Panto-Compound Production Methodologies Featuring Pantothenate 

Kinase Mutant Microorganisms 

The present invention relates to methods of producing pantothenate using 
microorganisms engineered to produce high yields of pantothenate as well as other 

30 panto-compounds. Cells overproducing pantothenate result in high intracellular 

pantothenate levels that could overcome the feedback inhibition of pantothenate kinase 
by CoA, leading to overproduction of CoA. Besides consuming pantothenate, increased 
synthesis of CoA may cause increased feedback inhibition of the PanB, PanD, PanE or 
PanC reaction, thereby limiting pantothenate production. Accordingly, a reduction in 

35 pantothenate kinase activity may lead to a decrease in CoA levels with resulting 
increases in PanB, PanD, PanE or PanC activity and pantothenate production. 
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Thus, certain methodologies of the present invention are based, at least in part, 
on the identification 'and characterizatipn of the B. subtilis coaA gene and the 
demonstration that the gene is neither essential for B. subtilis growth (le., deletion of the 
coaA gene from the chromosome of B. subtilis is not a lethal event) nor for pantothenate 
5 kinase activity in B. subtilis. A second pantothenate kinase-encoding gene has been , 
identified and characterized in B. subtilis, and is termed "coaA 7 '. This gene 

complements an E. coli mutant that contains a temperature sensitive pantothenate kinase 

t 1 i i 

arid is not related by homology to any previously known pantothenate kinase gene. 

i i t 

1 r 

In one aspect, the methodologies of the invention feature recombinant 
10 microorganisms that include the ccxxJf gene or that include a mutant coo^gene, having 
reduced pantothenate kinase activity. In one embodiment, the methodologies feature 
such recombinant microorganisms further having a deregulated pantothenate 
biosynthetic pathway. In another embodiment, the methodologies feature such 
recombinant microorganisms further having a deregulated isoleucine-valine (i/v) 
1 5 pathway. In a preferred embodiment, the microorganisms belong to the genus Bacillus 
(e.g., B. subtilis). 

The methodologies of the invention also feature recombinant microorganisms 
(e.g., microorganisms belong to the genus Bacillus, for example, B. subtilis) that 
include the coaA gene or that include a mutant coaA gene, optionally including a coaX 
20 gene or mutant thereof, having reduced pantothenate kinase activity. In one 

embodiment, the methodologies feature such recombinant microorganisms further 
having a deregulated pantothenate biosynthetic pathway or having a deregulated 
isoleucine-valine (z7v) pathway. Also featured are vectors that include isolated coaXox 
coaA genes as well as mutant coaX and/or coaA genes. Isolated nucleic acid molecules 
25 that include isolated coaA" genes or mutant coaX genes are features in addition to 
isolated CoaX proteins and mutant CoaX proteins. 

The above-described nucleic acid molecules (e.g., genes), proteins, vectors, and 
recombinant microorganisms (e.g. , mutant microorganisms), are particularly suited for 
use in methods of producing panto-compounds and/or methods of enhancing panto- 
30 compound production. In one embodiment, the invention features a method for 

producing a panto-compound (e.g., pantothenate) that includes culturing a pantothenate 
kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., is mutated 
for, pantothenate kinase, as defined herein) under conditions such that panto-compound 
is produced. In another embodiment, the invention features a method for enhancing 
35 production of a panto-compound (e.g., pantothenate) that includes culturing a 

pantothenate kinase mutant (e.g., a recombinant microorganism that misexpresses, e.g., 
is mutated for, pantothenate kinase, as defined herein) under conditions such that 
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35 



production of the panto-compound is produced. , As used herein, the term "enhancing" . 
(for example, in the context of the phrase "enhancing, production") includes increasing 
the level or rate of production of panto-compound (e.g. , pantothenate) as compared to 
the level or rate of production in a non-mutant microorganism (e.g. , a microorganism 
having a normal pantothenate kinase gene(s) and/or having normal pantothenate 
production rates and/or levels. 1 

Preferably, the level of pantdrcompound produced in methodologies featuring 
the pantothenate kinase mutants of the present invention is increased by at least 5% as 
compared to the level produced by a non-mutant (e.g., a recombinant microorganism 
expressing non-mutated pantothenate kinase). Even more preferably, the level of panto- 
compound is increased 10% as compared to methodologies featuring non-mutants. 
Even more preferably, panto-compound levels (e.g., pantothenate levels) are increased 
20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, are increased 2-fold, 5-fold, 10-fold, 50- 
fold, 1 00-fold or more as compared to methodologies featuring non-mutants. 



VI. 

Production 



Additional Mutations Resulting in Enhanced Panto-Compound 



The methodologies of the present invention further can include, for example in 
addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of key 
pantothenate biosynthesis substrates or precursors to unwanted or undesirable products. 
For example, mutating the UvE gene (Kuramitsu et al. (1985) J. Biochem. 97:993-999) 
or a homologue thereof (SEQ ID NO:62 or SEQ ID NO:64), thereby limiting the 
conversion of a-ketoisovalerate to valine, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
of panto-compound. Alternatively, mutating the ansB gene (Sun and Seflow (1991) J. 
Bacteriol 173:3831-3845) or a homologue thereof (SEQ ID NO:66), thereby limiting 
the degradation of aspartate, in addition to mutating a pantothenate kinase encoding 
enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Alternatively, mutating the alsD gene (Renna et al (1993) J. Bacteriol 
175:3863-3875) or a homologue thereof (SEQ ID NO:68), thereby limiting the 
conversion of acetolactate to acetoin, in addition to mutating a pantothenate kinase 
encoding enzyme, is predicted to result in even further enhanced or increased production 
of panto-compound. Alternatively, mutating the avtA gene encoding alanine- valine 
transaminase or a homologue thereof, thereby limiting the conversion of ct- 
ketoi so valerate to valine, in addition to mutating a pantothenate kinase encoding 
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enzyme, is predicted to result in even further enhanced or increased production of panto- 
compound. Mutating the avtA gene can include mutating, for example, an avtA gene 
having the nucleotide sequence of SEQ ID NO:70 (e.g., the E. coli avtA gene), or a 
structural homolog thereof (e.g., a homologue encoding a protein having 30-40%, 40- 
5 50%, 50-60%, 60-70%, 70-80%, 80-90%, 90-95% or more identity with the amino acid 
sequence of SEQ ID NO:71) or a functional homologue (e.g., a gene encoding a 

structurally unrelated protein having alanine-valine transmainase activity. Such ( 

* I ■ 

mutations can be accomplished using the methodologies as exemplified in the Examples 

' (e.g., Examples XIII, XV, XVI ank XVII). 

10 Accordingly, in one embodiment, the invention features a method of producing a 

panto-compound which includes culturing a microorganism having a mutant 
pantothenate kinase-encoding gene and which further has a deletion or mutation in an 
avtA, ilvE, ansB, and/or alsD gene, or homologue thereof In another embodiment, the 
invention features a method of producing a panto-compound which includes culturing a 

1 5 microorganism having a mutant pantothenate-kinase encoding gene and a deregulated 
pantothenate biosynthetic pathway enzyme and which further has a deletion or mutation 
in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. In another embodiment, 
the invention features a method of producing a panto-compound which includes 
culturing a microorganism having a mutant pantothenate-kinase encoding gene and a 

20 deregulated isoleucine-valine biosynthetic pathway enzyme and which further has a 
deletion or mutation in an avtA, ilvE, ansB, and/or alsD gene, or homologue thereof. 

Mutating the alsD gene can be particularly useful when accomplished in 
conjunction with overexpression or deregulation of the alsS gene, for example, to 
prevent carbon (e.g., acetolactate) from being drawn away from the precursor pool 

25 utilized for a-KIV production. Accordingly, to maximize the contribution of the als 
locus to panto-compound production, it is desirable to disrupt the alsD gene in addition 
to overexpressing the alsS gene. To disrupt the alsD gene, appropriate fragments of the 
als operon, flanking the alsD gene, are amplified by PGR and cloned to provide 
homology for creating the disruptions. A drug resistance gene, such as the cat gene, is 

30 cloned between the flanking DNA fragments in place of the alsD gene, and the 

linearized DNA is transformed into a pantothenate production strain such as PA824, 
selecting for drug-resistance. To overexpress alsS, the alsS coding sequence (e.g., an 
alsS coding sequence that has been engineered by PCR for expression) is cloned into an 
expression vector. Vectors which express alsS (or alternatively, vectors which express 

35 alsS plus ilvC) are the introduced into panto-compound production strains (e.g., the 
pantothenate producing strain PA824). 
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, The methodologies of the present invention further can include, P foj: example in . 

addition to overexpressing or deregulating a pantothenate biosynthetic enzyme and/or an 
isoleucine-valine biosynthetic enzyme, or in addition to mutating a pantothenate-kinase 
encoding gene, deleting or mutating an enzyme that catalyzes the conversion of desired 

5 panto-compounds to unwanted or undesireable downstream products. , 

i * i 

1 * • 1 

1 VIL Isolated Nucleic Acid Molecules and Genes • i 

, Another aspect of the present invention features isolated nucleic acid molecules 

that encode Bacillus proteins (e.g., B. subtil is proteins), for example, Bacillus 

10 pantothenate biosynthetic enzymes (e.g., B. subtilis pantothenate biosynthetic enzymes) 
or Bacillus valine-isoleucine biosynthetic enzymes (e.g., B. subtilis valine-isoleucine 
biosynthetic enzymes). Also featured are isolated coaX and/or cooA nucleic acid 
molecules (e.g., isolated coajt and/or coaA genes) as well as isolated nucleic acid 
molecules that include such coaX and/or coaA nucleic acid molecules or genes. 

15 The term "nucleic acid molecule" includes DNA molecules (e.g., linear, circular, 

cDNA or chromosomal DNA) and RNA molecules (e.g., tRNA, rRNA, mRNA) and 
analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid 
molecule can be single-stranded or double-stranded, but preferably is double-stranded 
DNA. The term "isolated" nucleic acid molecule includes a nucleic acid molecule 

20 which is free of sequences which naturally flank the nucleic acid molecule (i.e., 

sequences located at the 5' and 3* ends of the nucleic acid molecule) in the chromosomal 
DNA of the organism from which the nucleic acid is derived. In various embodiments, 
an isolated nucleic acid molecule can contain less than about 10 kb, 5 kb, 4kb, 3kb, 2kb, 
1 kb, 0.5 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally flank 

25 the nucleic acid molecule in chromosomal DNA of the microorganism from which the 
nucleic acid molecule is derived. Moreover, an "isolated" nucleic acid molecule, such 
as a cDNA molecule, can be substantially free of other cellular materials when produced 
by recombinant techniques, or substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

30 The term "gene", as used herein, includes a nucleic acid molecule (e.g., a DNA 

molecule or segment thereof), for example, a protein or RNA-encoding nucleic acid 
molecule, that in an organism, is separated from another gene or other genes, by 
intergenic DNA {i.e., intervening or spacer DNA which naturally flanks the gene and/or 
separates genes in the chromosomal DNA of the organism). A gene may direct 

35 synthesis of an enzyme or other protein molecule (e.g., may comprise coding sequences, 
for example, a contiguous open reading frame (ORF) which encodes a protein) or may 
itself be functional in the organism. A gene in an organism, may be clustered in an 
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operon, as defined herein, said operon being separated from other genes and/or operons 
by the intergenic DjNA. Individual geqes contained within an operon may overlap 
without intergenic DNA between said individual genes. An, "isolated gene", as used 
herein, includes a gene which is essentially free of sequences which naturally flank the 
5 gene in the chromosomal DNA of the organism from which the gene is derived (i.e., is 
free of adjacent coding sequences which' encode a second or distinct protein or RNA 
molecule, adjacent structural sequences or the like) and optionally includes 5' and 3' 
regulator^ sequences, for example promoter sequences and/or terminator sequences. In 

one embodiment, an isolated gene includes predominantly coding sequences for a 

i 

1 0 protein (e.g., sequences which encode Bacillus proteins). In another embodiment, an 
isolated gene incluides coding sequences for a protein (e.g., for a Bacillus protein) and 
adjacent 5* and/or 3' regulatory sequences from the chromosomal DNA of the organism 
from which the gene is derived (e.g., adjacent 5* and/or 3' Bacillus regulatory 
sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kfy 1 kb, 

15 0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences which naturally 
flank the gene in the chromosomal DNA of the organism from which the gene is , 
derived. 

In one aspect, the present invention features isolated panB nucleic acid 
sequences or genes, isolated panC nucleic acid sequences or genes, isolated panD 

20 nucleic acid sequences or genes, isolated panE nucleic acid sequences or genes, isolated 
ilvB, ilvN, ilvBN nucleic acid sequences or genes, isolated alsS nucleic acid sequences or 
genes, isolated ilvC nucleic acid sequences or genes and/or isolated ilvD nucleic acid 
sequences or genes. 

In a preferred embodiment, the nucleic acid or gene is derived from Bacillus 

25 (e.g., is 5ac/7/w5-derived). The term "derived from Bacillus" or "Ztac///ws-derived" 

includes a nucleic acid or gene which is naturally found in microorganisms of the genus 
Bacillus. Preferably, the nucleic acid or gene is derived from a microorganism selected 
from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, 
Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, 

30 Bacillus circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, 
Bacillus pumilus, Bacillus thuringiensis, and other Group 1 Bacillus species, for 
example, as characterized by 16S rRNA type (Priest, supra). In another preferred 
embodiment, the nucleic acid or gene is derived from Bacillus brevis or Bacillus 
stearothermophilus. In another preferred embodiment, the nucleic acid molecules 

35 and/or genes of the present invention are derived from a microorganism selected from 
the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus 
halodurans, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
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embodiment, the nucleic acid or gene is derived, from Bacillus subtilis (e.g., is Bacillus, 
subtil is-derivcd). The term "derived from Bacillus subtilis" or "Bacillus subtilis- 
derived" includes a nucleic acid or gene which is naturally found in Bacillus subtilis. In 
yet another preferred embodiment, the nucleic acid or gene is a Bacillus gene 
5 homologue (e.g., is derived from a species distinct froni Bacillus but having significant 
homology to a Bacillus gene of the present invention, for example, a Bacillus pan gene 
or Bacillus ilv gene). , i 

Included within the scope of the present inyention are bacterial-derived nucleic 
acid molecules or genes and/or Bacillus-derived nucleic acid molecules or genes (e.g., 
10 B. subtilis-derived nucleic acid molecules or genes), for example, the genes identified by 

the present inventors, for example, Bacillus or B. subtilis coaX genes, coaA genes, pan 

' ' ' . 

genes and/or ilv genes. Further included within the scope of the present invention are 

bacterial-derived nucleic acid molecules or genes and/or Bacillus-derived nucleic acid 

molecules or genes (e.g., B. subtilis-derived nucleic acid molecules or genes) (e.g.,B. 

15 subtilis nucleic afcid molecules or genes) which differ from naturally-occurring bacterial 

and/or Bacillus nucleic acid molecules or genes (e.g., B. subtilis nucleic acid molecules 

i 

or genes), for example, nucleic acid molecules or genes which have nucleic acids that 
are substituted, inserted or deleted, but which encode proteins substantially similar to the 
naturally-occurring gene products of the present invention. In one embodiment, an , 

20 isolated nucleic acid molecule comprises at least one of the nucleotide sequences set 
forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 
31, SEQ ID NO:33, SEQ ID NO:86, SEQ ID NO:35 or SEQ ID NO:37. In another 
preferred embodiment, an isolated nucleic acid molecule comprises at least two, three or 
four of the nucleotide sequences set forth as SEQ ID NO:23, SEQ ID NO:25, SEQ ID 

25 NO:27, SEQ ID NO:29, SEQ ID NO 3 1, SEQ ID NO:33, SEQ ID NO:88, SEQ ID 
NO:35 or SEQ ID NO:37. For example, a preferred isolated nucleic acid molecule of 
the present invention can include the nucleotide sequences of SEQ ID NO:23, SEQ ID 
NO:25 and SEQ ID NO:27, preferably linked such that the proteins encoded by the 
nucleotide sequences of SEQ ID NO:23, SEQ ID NO:25 and SEQ ID NO:27 are each 

30 produced when the isolated nucleic acid molecule is expressed in a microorganism (e.g., 
SEQ ID NO: 59). In another example, a preferred isolated nucleic acid molecule of the 
present invention can include the nucleotide sequences of SEQ ID NO:3 1 and SEQ ID 
NO:33, preferably linked such that the proteins encoded by the nucleotide sequences of 
SEQ ID NO:3 1 and SEQ ID NO:33 are each produced when the isolated nucleic acid 

35 molecule is expressed in a microorganism (e.g., nucleotides 1-2246 of SEQ ID NO:58). 
In another example, a preferred isolated nucleic acid molecule of the present invention 
can include the nucleotide sequence of SEQ ID NO: 86. In another example, a preferred 
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isolated nucleic acid molecule of the present invention can include the nucleotide 
sequences of SEQ ID NO:3 1 , SEQ ID NO:33 and' SEQ ID N<D:3S, preferably linked ' 

such that the proteins encoded by the nucleotide sequences of SEQ ID NO:3H SEQ ID 

■ 

NO:33 and SEQ ID NO:35 are each produced when the isolated nucleic acid molecule is 
5 expressed in a microorganism (e.g., SEQ ID NO:5 ( 8). 

In another embodiment, an isolated nucleic acid molecule of the present 
invention comprises a nucleotide sequence which is at least about 60-65%, preferably at 
least about 70-75%, more preferable at least about 80-85%, and even more preferably at 
1 least about 90-95% or more identical to a nucleotide sequence set forth as SEQ ID 
10 NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID 

NO:33, SEQ ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. In another embodiment, an 
isolated nucleic acid molecule hybridizes under stringent conditions to a nucleic acid 
molecule having a nucleotide sequence set forth as SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88, SEQ 

i { 

15 ID NO:35 or SEQ ID NO:37. Such stringent conditions are known to those skilled in 
the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent (e.g. high 
stringency) hybridization conditions are hybridization in 6X sodium chloride/sodium 
citrate (SSC) at about 45°C, followed by one or more washes in 0.2 X SSC, 0.1% SDS 

20 at 50-65°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent conditions to the sequence of SEQ ID NO:23, SEQ ID 
NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 31, SEQ ID NO:33, SEQ ID 
NO:88, SEQ ID NO:35 or SEQ ID NO:37 corresponds to a naturally-occurring nucleic 
acid molecule. As used herein, a "naturally-occurring" nucleic acid molecule refers to 

25 an RNA or DNA molecule having a nucleotide sequence that occurs in nature. 

A nucleic acid molecule of the present invention (e.g., a nucleic acid molecule 
having the nucleotide sequence of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29, SEQ ID NO 3 1, SEQ ID NO:33, SEQ ID NO:88, SEQ ID NO:35 or 
SEQ ID NO:37 can be isolated using standard molecular biology techniques and the 

30 sequence information provided herein. For example, nucleic acid molecules can be 
isolated using standard hybridization and cloning techniques (e.g., as described in 
Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular Cloning: A Laboratory Manual. 
2nd, ed, Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1 989) or can be isolated by the polymerase chain reaction using 

35 synthetic oligonucleotide primers designed based upon the sequence of SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO 3 1 , SEQ ID NO:33, SEQ 
ID NO:88, SEQ ID NO:35 or SEQ ID NO:37. A nucleic acid of the invention can be 
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amplified using cPNA, mRNA or alternatively, genomic DNA, as a template and 
appropriate oligonucleotide primers according to standard PCR amplification 
techniques. In another preferred embodiment, an isolated nucleic acid rnolecule of the 
invention comprises a nucleic acid molecule which is a complement of the nucleotide 
5 sequence shown in SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29; 
SEQ ID NO:33, SEQ ID NO 31, SEQ ID NO:33, SEQ ID NO:88,SEQ ID NO:35. 

Additional panC nucleic acid, sequences include, those, that comprise the 
nucleotide sequence of SEQ ID NO:25, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:26 (e.g., encode a polypeptide having 
10 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:26 and a substantially 

identical activity as said polypeptide), hybridize under stringent conditions, to all or a 

■ i 

portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:25 or 

f i 

to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

1 5 amino acid sequence of SEQ ID NO:26, or are complementary to a panC nucleotide 
sequence as set forth herein. 

Aditional panD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:27, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:28 (e.g., encode a polypeptide having 

20 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:28 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:27 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

25 amino acid sequence of SEQ ID NO:28, or are complementary to a panD nucleotide 
sequence as set forth herein. 

Additional panE nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:29, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:30 (e.g., encode a polypeptide having 

30 at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 
having the amino acid sequence as set forth in SEQ ID NO:30 and a substantially 
identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:29 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

35 amino acid sequence of SEQ ID NO:30, or are complementary to a panE nucleotide 
sequence as set forth herein, 
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Additional ilvB nucleic acid sequences are those that comprise the nucleotide 

sequence of SEQ ID NO:31, encode a homologue of the polypeptide having the amino 

• . 1 

acid sequence set forth in SEQ ID NO:32 (e.g., encode a polypeptide having at least 

30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or. more identity to the polypeptide having 

5 the amino acid sequence as set forth in SEQ ID NO:32 and a substantially identical 

activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 

nucleic acid molecule having the nucleotide sequence of SEQ ID NO:31 or to all or a , 

portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
• 1 | ' 

sequence of SEQ ID NO:32, or are complementary to an ilvB nucleotide sequence as set 

10 forth herein. I 

Additional ilvN nucleic acid sequences are those that comprise the nucleotide 
sequence of SEQ ID NO:33, encode a homologue of the polypeptide having the amino 
acid sequence set forth in SEQ ID NO:34 (e.g., encode a polypeptide having at least 
30%, 40%, 50%, 60%, 70%, 80%,|90%, 95% or more identity to the polypeptide having 

15 the amino acid sequence as set forth in SEQ ID NO:34 and a substantially identical 

activity as said polypeptide), hybridize under stringent conditions to all or a portion of a 
nucleic acid molecule having the nucleotide sequence of SEQ ID NO:33 or to all or a 
portion of a nucleic acid molecule that encodes a polypeptide having the amino acid 
sequence of SEQ ID NO:34, or are complementary to an ilvN nucleotide sequence as set 

20 forth herein. 

Additional ilvC nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:35, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:36 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

25 having the amino acid sequence as set forth in SEQ ID NO:36 and a substantially 

identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:35 or 
to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 
amino acid sequence of SEQ ID NO:36, or are complementary to an ilvC nucleotide 

30 sequence as set forth herein. 

Additional ilvD nucleic acid sequences include those that comprise the 
nucleotide sequence of SEQ ID NO:37, encode a homologue of the polypeptide having 
the amino acid sequence set forth in SEQ ID NO:38 (e.g., encode a polypeptide having 
at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

35 having the amino acid sequence as set forth in SEQ ID NO:38 and a substantially 

identical activity as said polypeptide), hybridize under stringent conditions to all or a 
portion of a nucleic acid molecule having the nucleotide sequence of SEQ ID NO:37 or 
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to all or a portion of a nucleic acid molecule that encodes a polypeptide, having the 

1 * 1 1 1 i 1 i 

amino acid sequence of SEQ ID NO:38, or are complementary to an ilvD nucleotide 

sequence as set forth herein.. 

Additional alsS nucleic acid sequences include those that comprise the 

5 nucleotide sequence of SEQ ID NO:86, encode a homologue of the polypeptide having 

the amino acid sequence set forth in SEQ ID NO:87 (e.g., encode a polypeptide having 

at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more identity to the polypeptide 

having the amino acid sequence as set forth in SEQ ID NO:87 and a substantially 

identical activity as said polypeptide), hybridize under stringent conditions to all or a 

10 portion of a nucleic acid molecule; having the nucleotide sequence of SEQ ID NO:86 or 

to all or a portion of a nucleic acid molecule that encodes a polypeptide having the 

amino acid sequence of SEQ ID NO: 87, or are complementary to an alsS nucleotide 

L 

sequence as set forth herein. 

In another embodiment, an isolated nucleic acid molecule is or includes a coaX 

1 5 gene, or portion or fragment thereof. In one embodiment, an isolated coaX nucleic acid 
molecule or gene comprises the nucleotide sequence as set forth in SEQ ID NO: 19 (e.g., 
comprises the B. subtilis coaX nucleotide sequence). In another embodiment, an 
isolated coaX nucleic acid molecule or gene comprises a nucleotide sequence that 
encodes the amino acid sequence as set forth in SEQ ID NO:9 (e.g., encodes the B. 

20 subtilis CoaX amino acid sequence). In yet another embodiment, an isolated coaX 
nucleic acid molecule or gene encodes a homologue of the CoaX protein having the 
amino acid sequence of SEQ ID NO:9. As used herein, the term "homologue" includes 
a protein or polypeptide sharing at least about 30-35%, preferably at least about 35-40%, 
more preferably at least about 40-50%, and even more preferably at least about 60%, 

25 70%, 80%, 90% or more identity with the amino acid sequence of a wild-type protein or 
polypeptide described herein and having a substantially equivalent functional or 
biological activity as said wild-type protein or polypeptide. For example, a CoaX 
homologue shares at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, and even more preferably at least about 60%, 70%, 

30 80%, 90% or more identity with the protein having the amino acid sequence set forth as 
SEQ ID NO:9 and has a substantially equivalent functional or biological activity (i.e., is 
a functional equivalent) of the protein having the amino acid sequence set forth as SEQ 
ID NO:9 (e.g., has a substantially equivalent pantothenate kinase activity). In a 
preferred embodiment, an isolated coaX nucleic acid molecule or gene comprises a 

35 nucleotide sequence that encodes a polypeptide as set forth in any one of SEQ ID NO:7, 
SEQ ID NO:8, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ 
ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID 
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NO:74 or SEQ ID NO:75. In another embodiment, an isolated coaX nucleic acid 
molecule hybridizes to all or a portion of a nucleic acid molecule having the nucleotide 
sequence set forth in SEQ ID NO: 19 or hybridizes to all or a portion of a nucleic acid 
molecule having a nucleotide sequence that encodes a polypeptide having the amino 
5 acid sequence of any of SEQ ID NOs:7-l 8, 74 or 75. Such hybridization conditions are 
known to those skilled in the art and can be found in Current Protocols in Molecular 
Biology Ausubel et al. 9 eds., John Wiley & Sons, Inc. (1995), sections 2, 4 and 6. 
AiJditional stringent conditions can be found in Molecular Cloning: A Laboratory 
Manual, Sambrook et a/., Cold Spring Harbor Press, Cold Spring Harbor, NY (1989), 
10 chapters 7, 9 and 1 1. A preferred, non-limiting example of stringent hybridization 

conditions includes hybridization in 4X sodium chloride/sodium citrate (SSC), at about 
65-70°C (or hybridization in 4X SSC plus 50%' formamide at about 42-50°C) followed 

i 

by one or more washes in IX SSC, at about 65-70°C. A preferred, non-limiting 
example of highly stringent hybridization conditions includes hybridization in IX SSC, 

1 5 at about 65-70°C (or hybridization in IX SSC plus 50% formamide at about 42-50°C) 
followed by one or more washes in 0.3X SSC, at about 65-70°C. A preferred, non- 
limiting example of reduced stringency hybridization conditions includes hybridization 
in 4X SSC, at about 50-60°C (or alternatively hybridization in 6X SSC plus 50% 
formamide at about 40-45°C) followed by one or more washes in 2X SSC, at about 50- 

20 60°C. Ranges intermediate to the above-recited values, e.g., at 65-70°C or at 42-50°C 
are also intended to be encompassed by the present invention. SSPE (IX SSPE is 0.15 
M NaCl, lOmM NaH 2 P0 4 , and 1.25 mM EDTA, pH 7.4) can be substituted for SSC (IX 
SSC is 0.15 M NaCl and 15 mM sodium citrate) in the hybridization and wash buffers; 
washes are performed for 1 5 minutes each after hybridization is complete. The 

25 hybridization temperature for hybrids anticipated to be less than 50 base pairs in length 
should be 5-10°C less than the melting temperature (T m ) of the hybrid, where T m is 
determined according to the following equations. For hybrids less than 1 8 base pairs in 
length, T m (°C) « 2(# of A + T bases) + 4(# of G + C bases). For hybrids between 18 and 
49 base pairs in length, T m (°C) - 81.5 + 16.6(log 10 [Na + ]) + 0.41(%G+C) - (600/N), 

30 where N is the number of bases in the hybrid, and [Na + ] is the concentration of sodium 
ions in the hybridization buffer ([Na + ] for IX SSC = 0.165 M). It will also be 
recognized by the skilled practitioner that additional reagents may be added to 
hybridization and/or wash buffers to decrease non-specific hybridization of nucleic acid 
molecules to membranes, for example, nitrocellulose or nylon membranes, including but 

35 not limited to blocking agents (e.g., BSA or salmon or herring sperm carrier DNA), 
detergents (e.g., SDS), chelating agents (e.g, EDTA), Ficoll, PVP and the like. When 
using nylon membranes, in particular, an additional preferred, non-limiting example of 
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stringent hybridization conditions is hybridization in 0.25-0.5M NaH2P04> 7% SDS at . 
about 65°C, followed by one or more Washes at 0.02M NaH 2 P04, 1% SDS at 65°C, see 
e.g., Church anci Gilbert (1984) Proc. Natl Acad. Set USA 81:1991-1995, (or, 
alternatively, 0.2X SSC, 1% SDS). In another preferred embodiment, an isolated 
5 nucleic acid molecule comprises a nucleotide sequence that is complementary to a cdaX 
nucleotide sequence as set forth herein (e.g., is the full complement of the nucleotide 
sequence set forth as SEQ ID NO: 19).» i 

In another preferred embodiment, an isolated nucleic acid molecule is or 
includes a coaA gene, for example, a Bacillus (e.g., B\ subtilis) coaA gene, or portion or 

10 fragment thereof. Exemplary isolated coaA nucleic acid molecules and/or genes include 

» • 

(1) an isolated coaA nucleic acid molecule or gene comprising the nucleotide sequence 
as set forth in any one of SEQ ID NOs:20-22; (2) an isolated coaA nucleic acid molecule 
or gene comprising a nucleotide sequence that encodes the amino acid sequence as set 
forth in SEQ ID NO:3; (3) an isolated coaA nucleic acid molecule or gene comprising a 

15 nucleotide sequence which encodes a CoaA homologue (e.g., a polypeptide having an 
amino acid sequence at least about 30-35%, preferably at least about 35-40%, more 
preferably at least about 40-50%, and even more preferably at least about 60%, 70%, 
80%, 90% or more identical to the amino acid sequence set forth as SEQ ID NO:3 and 
having a substantially equivalent enzymatic activity; (4) an isolated coaA nucleic acid 

20 molecule or gene comprising a nucleotide sequence that encodes a polypeptide as set 
forth in any one of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:5 or SEQ 
ID NO:6; (5) an isolated nucleic acid molecule that hybridizes under stringent 
conditions to all or a portion of a nucleic acid molecule having the nucleotide sequence 
set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22 or hybridizes to all or a 

25 portion of a nucleic acid molecule having a nucleotide sequence that encodes a 
polypeptide having the amino acid sequence of SEQ ID NO:3; and (6) an isolated 
nucleic acid molecule comprising a nucleotide sequence that is complementary to a 
coaA nucleotide sequence as set forth herein (e.g., is the full complement of the 
nucleotide sequence set forth in SEQ ID NO:20, SEQ ID NO:21 or SEQ ID NO:22). 

30 A nucleic acid molecule of the present invention (e.g., a coaX nucleic acid 

molecule or gene or a coaA nucleic acid molecule or gene), can be isolated using 
standard molecular biology techniques and the sequence information provided herein. 
For example, nucleic acid molecules can be isolated using standard hybridization and 
cloning techniques (e.g., as described in Sambrook, J., Fritsh, E. F., and Maniatis, T. 

35 Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor Laboratory, 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) or can be isolated 
by the polymerase chain reaction using synthetic oligonucleotide primers designed 
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based upon the coaX or coaA nucleotide sequences set forth herein, or flanking 

sequences thereof. A nucleic acid of the invention (e.g., a coaA'nucleic/acid molecule or 
gene or a coaA nucleic acid molecule or gene), can be amplified using cDHAi mRNA or 
alternatively, chromosomal DNA, as a template and appropriate oligonucleotide primers 
5 according to standard PCR amplification techniques. 

Yet another embodiment of the present invention features mutant coaX and coaA 
nucleic acid molecules or genes. The phrase "mutant nucleic acid molecule" or "mutant 
gene" as used herein, includes a nucleic acid molecule or gene having a nucleotide 
1 sequence which includes at least one alteration (e.g., substitution, insertion, deletion) 

1 0 such that the polypeptide or protein that may be encoded by said mutant exhibits an 
activity that differs from the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Preferably, a mutant nucleic acid molecule or mutant gene {e.g., 
a mutant coaA or coaX gene) encodes a polypeptide or protein having a reduced activity 
(e.g., having a reduced pantothenate kinase activity) as compared to the polypeptide or 

15 protein encoded by the wild-type nucleic acid molecule or gene, for example, when 

assayed under similar conditions (e.g., assayed in microorganisms cultured at the same 
temperature). A mutant gene also can encode no polypeptide or have a reduced level of 
production of the wild-type polypeptide. 

As used herein, a "reduced activity" or "reduced enzymatic activity" is one that 

20 is at least 5% less than that of the polypeptide or protein encoded by the wild-type 

nucleic acid molecule or gene, preferably at least 5-10% less, more preferably at least 
10-25% less and even more preferably at least 25-50%, 50-75% or 75-100% less than 
that of the polypeptide or protein encoded by the wild-type nucleic acid molecule or 
gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 85-90%, 90-95%, 

25 are also intended to be encompassed by the present invention. As used herein, a 

"reduced activity" or "reduced enzymatic activity" also includes an activity that has 
been deleted or "knocked out" (e.g., approximately 100% less activity than that of the 
polypeptide or protein encoded by the wild-type nucleic acid molecule or gene). 
Activity can be determined according to any well accepted assay for measuring activity 

30 of a particular protein of interest. Activity can be measured or assayed directly, for 
example, measuring an activity of a protein isolated or purified from a cell. 
Alternatively, an activity can be measured or assayed within a cell or in an extracellular 
medium. For example, assaying for a mutant coaA gene or a mutant coo^gene (i.e., 
said mutant encoding a reduced pantothenate kinase activity) can be accomplished by 

35 expressing the mutated gene in a microorganism, for example, a mutant microorganism 
which expresses pantothenate kinase in a temperature-sensitive manner, assaying the 
mutant gene for the ability to complement a temperature sensitive (Ts) mutant for 
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pantothenate kinase activity. A coaX mutant gene or coaA mutant gene, that encodes a 
"reduced pantothenate kinase activity" is one that complements the Ts mutant less 
effectively than, for example, a corresponding wild-type coaXgene or coaA gene. 

It will be appreciated by the skilled artisan that even a single substitution in a 
nucleic acid or gene sequence (e.g., a base substitution that encodes an amino acid 
change in the corresponding amino acid sequence) can dramatically affect the activity of 



an encoded polypeptide or protein 



as compared to the corresponding wild-type' 



polypeptide or protein. A mutant nucleic acid or mutant gene (e.g., encoding a mutant 
polypeptide or protein), as defined herein, is readily distinguishable from a nucleic acid 
10 or gene encoding a protein homologue, as described above, in that a mutant nucleic acid 
or mutant gene encodes a protein or polypeptide having an altered activity, optionally 
observable as a different or distinct phenotype in a microorganism expressing said 
mutant gene or nucleic acid of producing said mutant protein or polypeptide (i.e., a 

mutant microorganism) as compared to a corresponding microorganism expressing the 

i 1 

15 wild-type gene or nucleic acid or producing said mutant protein or polypeptide. By 
contrast, a protein homologue has an identical or substantially similar activity, 
optionally phenotypically indiscernable when produced in a microorganism, as 
compared to a corresponding microorganism expressing the wild-type gene or nucleic 
acid. Accordingly it is not, for example, the degree of sequence identity between 

20 nucleic acid molecules, genes, protein or polypeptides that serves to distinguish between 
homologues and mutants, rather it is the activity of the encoded protein or polypeptide 
that distinguishes between homologues and mutants: homologues having, for example, 
low (e.g., 30-50% sequence identity) sequence identity yet having substantially 
equivalent functional activities, and mutants, for example sharing 99% sequence identity 

25 yet having dramatically different or altered functional activities. Exemplary 

homologues are set forth in Figure 20 (i.e., CoaA homologues) and in Figure 23 (i.e., 
CoaX homologues). Exemplary mutants are described in Examples XV and XVIII 
herein. 



30 VIII. Recombinant Nucleic Acid Molecules and Vectors 

The present invention further features recombinant nucleic acid molecules (e.g., 
recombinant DNA molecules) that include nucleic acid molecules and/or genes 
described herein (e.g., isolated nucleic acid molecules and/or genes), preferably Bacillus 
genes, more preferably Bacillus subtilis genes, even more preferably Bacillus subtilis 

35 pantothenate kinase genes (e.g., coaX genes or coaA genes), pantothenate biosynthetic 
genes (e.g., genes encoding pantothenate biosynthetic enzymes, for example, panB 
genes encoding ketopantoate hydroxymethy ltransferase, panE genes encoding 
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ketopantoate reductase, panC genes encoding pantothenate synthetase, and/or panD 
genes encoding aspartate-ot-decarboxylase) and/or isoleucine-valine (ilv) biosynthetic 
genes (e.g., ilvBNox alsS genes encoding acetohydroxyacid synthetase^ ilvC genes 
encoding acetohydrbxyacid isomeroreductase and/or ilvD genes encoding dihydroxyacid 
5 dehydratase). ■ ( ■ 

The present invention further features vectors (e.g., recombinant vectors) that 
include nucleic acid molecules (e.g., isolated or recombinant nucleic acid molecules 
and/or genes) described herein. In particular^ recombinant vectors are featured that 
include nucleic acid sequences that encode bacterial gene products as described herein, 

1 0 preferably Bacillus gene products, more preferably Bacillus subtilis gene products, even 
more preferably Bacillus subtilis pantothenate biosynthetic gene products (e.g., 
pantothenate biosynthetic enzymes, for examplfe, ketopantoate 
hydroxymethyltransferase, ketopantoate reductase, pantothenate synthetase, and/or 
aspartate-a-decarboxylase) and/or isoleucine-valine biosynthetic gene products (e.g., 

1 5 acetohydroxyacid synthetase, acetohydroxyacid isomeroreductase and/or dihydroxyacid 
dehydratase). 

The term "recombinant nucleic acid molecule" includes a nucleic acid molecule 
(e.g., a DNA molecule) that has been altered, modified or engineered such that it differs 
in nucleotide sequence from the native or natural nucleic acid molecule from which the 

20 recombinant nucleic acid molecule was derived (e.g., by addition, deletion or 
substitution of one or more nucleotides). Preferably, a recombinant nucleic acid 
molecule (e.g., a recombinant DNA molecule) includes an isolated nucleic acid 
molecule or gene of the present invention (e.g., an isolated coaX f coaA, pan or ilv gene) 
operably linked to regulatory sequences. 

25 The term "recombinant vector** includes a vector (e.g., plasmid, phage, phasmid, 

virus, cosmid or other purified nucleic acid vector) that has been altered, modified or 
engineered such that it contains greater, fewer or different nucleic acid sequences than 
those included in the native or natural nucleic acid molecule from which the 
recombinant vector was derived. Preferably, the recombinant vector includes a coaX, 

30 coaA, pan or ilv gene or recombinant nucleic acid molecule including such coaX t coaA, 
pan or ilv gene, operably linked to regulatory sequences, for example, promoter 
sequences, terminator sequences and/or artificial ribosome binding sites (RBSs), as 
defined herein. 

The phrase "operably linked to regulatory sequence(s)" means that the 
35 nucleotide sequence of the nucleic acid molecule or gene of interest is linked to the 
regulatory sequence(s) in a manner which allows for expression (e.g., enhanced, 
increased, constitutive, basal, attenuated, decreased or repressed expression) of the 
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nucleotide sequence, preferably expression of a gene product encoded by the nucleotide; 
sequence (e.g., when the recombinant nucleic acid molecule is included in ,a * 
recombinant vector, as defined herein, and is introduced into a microorganism). 

The term "regulatory sequence" includes nucleic acid sequences which affect 

(e.g., modulate or regulate) expression of other nucleic acid sequences. ( In one 

1 t _ . 

embodiment, a regulatory sequence is included in a recombinant nucleic acid molecule 



or recombinant vector in a similar 



or identical position and/or orientation relative to a 



particular gene of interest as is observed for the regulatory sequence and gene of interest 
as it appears in nature, e.g., in a native position and/or orientation. For example, a gene 



I-" ' 



10 of interest can be included in a recombinant nucleic acid molecule or recombinant vector 
operably linked to a regulatory sequence which accompanies or, is adjacent to the gene 
of interest in the natural organism (e.g., operably linked to "native" regulatory 
sequences, for example, to the "native" promoter). Alternatively, a gene of interest can 
be included in a recombinant nucleic acid molecule or recombinant vector operably 

1 5 linked to a regulatory sequence which accompanies or is adjacent to another (e.g., a 

t ' 

different) gene in the natural organism. Alternatively, a gene of interest can be included 
in a recombinant nucleic acid molecule or recombinant vector operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
other microbes (e.g., other bacterial regulatory sequences, bacteriophage regulatory 

20 sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non-naturally- 
occurring sequence (e.g., a sequence which has been modified, mutated, substituted, 
derivatized, deleted including sequences which are chemically synthesized). Preferred 
regulatory sequences include promoters, enhancers, termination signals, anti-termination 

25 signals and other expression control elements (e.g., sequences to which repressors or 
inducers bind and/or binding sites for transcriptional and/or translational regulatory 
proteins, for example, in the transcribed mRNA). Such regulatory sequences are 
described, for example, in Sambrook, J., Fritsh, E. F., and Maniatis, T. Molecular 
Cloning: A Laboratory Manual. 2nd, ed. f Cold Spring Harbor Laboratory, Cold Spring 

30 Harbor Laboratory Press, Cold Spring Harbor, NY, 1 989. Regulatory sequences include 
those which direct constitutive expression of a nucleotide sequence in a microorganism 
(e.g., constitutive promoters and strong constitutive promoters), those which direct 
inducible expression of a nucleotide sequence in a microorganism (e.g., inducible 
promoters, for example, xylose inducible promoters) and those which attenuate or 

35 repress expression of a nucleotide sequence in a microorganism (e.g., attenuation signals 
or repressor sequences). It is also within the scope of the present invention to regulate 
expression of a gene of interest by removing or deleting regulatory sequences. For 
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example, sequences involved in the negatiye regulation of transcription can be removed 

such that expression of a gene of interest is enhance^-. 1 

' i 1 i 

In one embodiment, a recombinant nucleic acid molecule or recombinant vector 
of the present invention includes a nucleic acid sequence or gene that encodes at least 
5 one bacterial gene product (e.g., a pantothenate biosynthetic enzyme, an isoleucine- 
valine biosynthetic enzyme, or a CoaA biosypthetic Enzyme, for Example CoaA or 
CoaX) operably linked to a promoter oi^ promoter sequence. Preferred promoters of the 
present invention include Bacillus promoters and/or bacteriophage promoters (e.g., 
bacteriophage which infect Bacillus). In one embpdiment, a promoter is a Bacillus 

10 promoter, preferably a strong Bacillus promoter (e.g., a promoter associated with a 
biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from' the bacteriophage SP01. In 
a particularly preferred embodiment, a promoter is selected from the group consisting of 

1 5 P !5 , P 26 or P veg9 for example, the promoters set forth in SEQ ID NO:39, SEQ ID NO:40 
or SEQ ID NO:4l . Additional preferred promoters include tef (the translational . 
elongation factor (TEF) promoter) and pyc (the pyruvate carboxylase (P YC) promoter), > 
which promote high level expression in Bacillus (e.g., Bacillus subtilis). Additional 
preferred promoters, for example, for use in Gram positive microorganisms include, but 

20 are not limited to, the amyE promoter or phage SP02 promoters. Additional preferred 
promoters, for example, for use in Gram negative microorganisms include, but are not 
limited to tac t trp, tet, trp-tet, lpp t lac, Ipp-lac, laclq, T7, T5, T3, gal, trc, ara, SP6, X-Pr 
or X-Pl. 

In another embodiment, a recombinant nucleic acid molecule or recombinant 
25 vector of the present invention includes a terminator sequence or terminator sequences 
(e.g., transcription terminator sequences). The term "terminator sequences" includes 
regulatory sequences which serve to terminate transcription of a gene. Terminator 
sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g., by adding structure to mRNA), for example, against nucleases. 
30 In yet another embodiment, a recombinant nucleic acid molecule or recombinant 

vector of the present invention includes sequences which allow for detection of the 
vector containing said sequences (i.e., detectable and/or selectable markers), for 
example, sequences that overcome auxotrophic mutations, for example, ura3 or ilvE, 
fluorescent markers, and/or colorimetric markers (e.g., /acZ/p-galactosidase), and/or 
35 antibiotic resistance genes (e.g., amp or tet). 
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In yet another embodiment, a recombinant nucleic acid molecule or recombinant 
vector of the present invention includes an artificial ribosome binding site (RBS). The 
term "artificial ribosome binding site (RBS)" includes a site within an rhRNA molecule 
(e.g., coded within DNA) to which a ribosome binds (e.g., to initiate translation) which 
5 differs from a native RBS; (e.g., a RBS found in a' naturally-occurring gene) by at least 
one nucleotide. Preferred artificial RBSs include about 5-6, 7-8, 9-10, 1 1-12, 13-14, 15- 
16, 17-18, 19-20, 21-22, 23-24, 25-26; 27-28, 29-30 or more nucleotides of which about 
1-2, 3-4, 5-6, 7-8, 9-10, 1 1-12, 13-15 or more differ from the native RBS (e.g., the 
native RBS of a gene of interest). Preferably, nucleotides which differ are substituted 

10 such that they are identical to one or more nucleotides of an ideal RBS (e.g., SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47 or SEQ ID NO:48), when 
optimally aligned for comparisons. Artificial RBSs can be used to replace the naturally- 
occurring or native RBS associated with a particular gene. Artificial RBSs preferably 
increase translation of a particular gene. Preferred artificial RBSs (e.g., RBSs for 

15 increasing the translation of panB, for example, of B. subtilis panB) are depicted in 

Table IA (e.g. , SEQ ID NO:49 and SEQ ID NO:50). 

t 

Table I A: Preferred panB Ribosome Binding Sites 

20 10 20 

I 1 

AGAAAGGAGGTGA ideal RBS (SEQ ID NO: 44) 

CCCTCT - AG - AAGGAGG AGAAAACATG RBS1 (SEQ ID NO: 4 9) 

CCCTCT - AG - - AGGAGGAGAAAACATG RBS 2 (SEQ ID NO: 50) 

25 TAAACAT-G-- AGGAGGAGAAAACATG panB native RBS (SEQ ID NO: 42) 



30 



Additional preferred artificial RBSs (e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table IB (e.g., SEQ ID NO:51, 
SEQ ID NO:52, SEQ ID NO:53 and SEQ ID NO:54). 

Table IB: Preferred panD Ribosome Binding Sites 



10 

1 



20 
1 



35 CTAGAAAA GGAGGA ATTTAAATG 
TTAAGAAAGGAG GTG ANNNNATG 



pAN423 RBS (SEQ ID NO: 88) 
ideal RBS (SEQ ID NO: 45) 



TTAGAAAGGAGGATXTAAATATG 
TTAGAAA GGAGG TTTAATTAATG 
40 TTAGAAAGGAGG TG ATTTAAATG 
TTAGAAAGGAGG TGTTTAAAATG 
TTAGAAAGGAGGTGANNNNNATG 



new design A (SEQ ID NO: 51) 
new design B (SEQ ID NO: 52) 
new design CI (SEQ ID NO: 53) 
new design C2 (SEQ ID NO: 54) 
ideal RBS (SEQ ID NO: 46) 
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Additional preferred artificial RBSs (e.g., RBSs for increasing the translation of 
panD, for example, of B. subtilis panD) are depicted in Table 1C (e.g., SEQ ID NO:55, 
SEQ ID NO:56 and SEQ ID NO:57). The predicted amino acid sequence at the C- 
terminus of the PanC protein is shown. The start codon for PanD translation is 
underlined. 



p 



Table 1C: Additional Preferred panD Ribosome Binding Sites 

1 ' 10 20 i 

io j i i 



.-A GAA AGG AGG TGA NNN. NNN N ATG ideal RBS (SEQ ID NO:47) 

ATT CGA GAA ATG GAG AGA ATA TAA T ATG native panD RBS (SEQ ID NO: 43) 

He Arg Glu Met Glu Arg He * Met SEQ ID NO: 89 

__A GAA AGG AGG TGA NNN NNN N ATG ideal RBS (SEQ ID NO: 47) 

i 

ATT CGA GAA AGG AGG TGA ATA' TAA T ATG NDI (SEQ ID NO: 55) ( 

He Arg Glu Arg Arg * Met SEQ ID NO: 90 

ATT CGA GAA AGG AGG TGA ATA ATA - ATG NDII (SEQ ID NO: 56) 

He Arg Glu Arg Arg * Met SEQ ID NO: 90 

ATT CGT AGA AAG GAG GTG AAT TAA T ATG NDIII (SEQ ID NO: 57) 

25 He Arg Arg Lys Glu Val Asn * Met SEQ ID NO: 91 

AGA AAG GAG GTG ANN NNN N ATG ideal RBS (SEQ ID NO: 48) 



30 Accordingly, in one embodiment, a vector of the present invention includes an 

artificial RBS as set forth in SEQ ID NO:49 or SEQ ID NO:50. In another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53 or SEQ ID NO:54. In yet another embodiment, 
a vector of the present invention includes an artificial RBS as set forth in SEQ ID 

35 NO:55, SEQ ID NO:56 or SEQ ID NO:57. 

In another embodiment, a recombinant vector of the present invention includes 
sequences that enhance replication in bacteria (e.g., replication-enhancing sequences). 
In one embodiment, replication-enhancing sequences are derived from E. coli. In 
another embodiment, replication-enhancing sequences are derived from pBR322 (e.g., 

40 sequences included within the pBR322 derived portion of any of the pAN vectors as set 
forth in the Figures , i.e., the Not I-Not I sequences from about 5.0 kB to 9.0 kB of the 
vector depicted in Figure 3A). 



WO 01/21772 PCT/US00/25993 

- 50 - 

* . * * 

, In yet another embodiment, a recombinant vector, of the. present.inyention 

includes antibiotic resistance genes. The term "antibiotic resistance geriesV includes 
sequences which promote or confer resistance to antibiotics on the host organism (e.g., 
Bacillus), In one embodiment, the antibiotic resistance genes are selected from the 
5 group consisting of cat (chloramphenicol resistance) genes, let (tetracycline resistance) 
genes, erm (erythromycin resistance) genes, neo (neomycin resistance) genes and spec 
(spectinomycin resistance) genes. Recombinant vectors of the present invention can i 
further include homologous recombination sequences (e.g., sequences designed to allow 
recombination of the gene of interest into the chromosome of the host organism). For 

' s I i ; \ 

1 0 example, amyE sequences can be used as homology targets for recombination into the 
host chromosome. 

Preferred vectors of the present invention include, but are not limited to, vectors 
set forth in Figures 2-15, 17, 1 9, 22, 25 and 26. It will further be appreciated by one of 
skill in the art that the design of a vector can be tailored depending on such factors as the 

i 1 

15 choice of microorganism to be genetically engineered, the level of expression of gene 
product desired and the like. 

IX. Isolated Proteins 
Another aspect of the present invention features isolated proteins (e.g., isolated 

20 pantothenate biosynthetic enzymes and/or valine-isoleucine biosynthetic enzymes and/or 
isolated CoA biosynthetic enzymes, for example isolated CoaA or CoaX). In one 
embodiment, proteins (e.g., isolated pantothenate biosynthetic enzymes and/or valine- 
isoleucine biosynthetic enzymes and/or isolated CoaA biosynthetic enzymes, for 
example isolated CoaA or CoaX) are produced by recombinant DNA techniques and can 

25 be isolated from microorganisms of the present invention by an appropriate purification 
scheme using standard protein purification techniques. In another embodiment, proteins 
are synthesized chemically using standard peptide synthesis techniques. 

An "isolated" or "purified" protein (e.g., an isolated or purified biosynthetic 
enzyme) is substantially free of cellular material or other contaminating proteins from 

30 the microorganism from which the protein is derived, or substantially free from 
chemical precursors or other chemicals when chemically synthesized. In one 
embodiment, an isolated or purified protein has less than about 30% (by dry weight) of 
contaminating protein or chemicals, more preferably less than about 20% of 
contaminating protein or chemicals, still more preferably less than about 1 0% of 

35 contaminating protein or chemicals, and most preferably less than about 5% 
contaminating protein or chemicals. 
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In a preferred embodiment, the protein or gene product is derived from Bacillus, 

(e.g., is Bacillus-derived). The term "derived from Bpcillus" dr "itac/7/w.s-derived" 

• * i • 

includes a protein or gene product which is encoded by a Bacillus gene!. Preferably, the 

gene product is derived from a microorganism selected from the group consisting of 

5 Bacillus subtilis, Bacillus lent imorbus, Bacillus lentus, Bacillus Jirmus, Bacillus 

pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus circulans, Bacillus 

coagulans, Bacillus licheniformis, Bacijlus megaterium, Bacillus pumilus, Bacillus 

thuringiensis, and other Group 1 Bacillus species, for example, as characterized by 16S 

i ' 
rRNA type (Priest, supra). In another preferred embodiment, the protein or gene 

10 product is derived from Bacillus brevis or Bacillus stearothermophilus. In another 
preferred embodiment, the protein or gene product is derived from a microorganism 
selected from the group consisting of Bacillus licheniformis, Bacillus amyloliquefaciens, 
Bacillus haloduraris, Bacillus subtilis, and Bacillus pumilus. In a particularly preferred 
embodiment, the protein or gene product is derived from Bacillus subtilis (e.g., is 

15 Bacillus subtilis-derived). The term "derived from Bacillus subtilis" or "Bacillus 
subtilis-derived" includes a protein or gene product which is encoded by a Bacillus 
subtilis gene. In yet another preferred embodiment, the protein or gene product is 
encoded by a Bacillus gene. homologue (e.g., a gene derived from a species distinct from 
Bacillus but having significant homology to a Bacillus gene of the present invention, for 

20 example, a Bacillus pan gene or Bacillus ilv gene). 

Included within the scope of the present invention are bacterial -derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded by naturally-occurring bacterial and/or Bacillus 
genes (e.g., B. subtilis genes), for example, the genes identified by the present inventors, 

25 for example, Bacillus or B. subtilis coaX genes, coaA genes, pan genes and/or ilv genes. 
Further included within the scope of the present invention are bacterial-derived proteins 
or gene products and/or Bacillus-derived proteins or gene products (e.g., B. subtilis- 
derived gene products) that are encoded bacterial and/or Bacillus genes (e.g., B. subtilis 
genes) which differ from naturally-occurring bacterial and/or Bacillus genes (e.g., B. 

30 subtilis genes), for example, genes which have nucleic acids that are mutated, inserted or 
deleted, but which encode proteins substantially similar to the naturally-occurring gene 
products of the present invention. For example, it is well understood that one of skill in 
the art can mutate (e.g., substitute) nucleic acids which, due to the degeneracy of the 
genetic code, encode for an identical amino acid as that encoded by the naturally- 

35 occurring gene. Moreover, it is well understood that one of skill in the art can mutate 
(e.g., substitute) nucleic acids which encode for conservative amino acid substitutions. 
It is further well understood that one of skill in the art can substitute, add or delete 
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amino acids to a certain degree without substantially affecting the ftinction of a gene 
product as compared with a naturally-occurring gene product, each instance of which is 
intended to be included within the scope of the present invention. 

In a preferred embodiment, an isolated protein of the present invention (e.g., an 
5 isolated pantothenate biosynthetic enzyme and/or an isolated isoleucine-valine 
biosynthetic enzyme and/or an isolated CoaA biosynthetic enzymes, for example 
isolated CoaA or CoaX) has an amino acid sequence shown in SEQ ID NO:3, SEQ ID 
NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87. In other 
10 embodiments, an isolated protein of the present invention is a homologue of the at least 
one of the proteins set forth as SEQ ID, NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:36, SEQ ID NO:38 or SEQ ID NO:87 (e.g., comprises an amino acid sequence at 
least about 30-40% identical, preferably about 40-50% identical, more preferably about 
15 50-60% identical; and even more preferably about 60-70%, 70-80%, 80-90%, 90-95% or 
more identical to the amino acid sequence of SEQ ID NO:3, SEQ ID NO:9, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID NO:87, and has an activity that is 
substantially similar to that of the protein encoded by the amino acid sequence of SEQ 
20 ID NO:3, SEQ ID NO:9, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:36, SEQ ID NO:38 or SEQ ID 
NO: 87, respectively. 

To determine the percent homology of two amino acid sequences or of two 
nucleic acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can 
25 be introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 
alignment with a second amino or nucleic acid sequence). When a position in the first 
sequence is occupied by the same amino acid residue or nucleotide as the corresponding 
position in the second sequence, then the molecules are identical at that position. The 
percent identity between the two sequences is a function of the number of identical 
30 positions shared by the sequences (i.e., % identity = # of identical positions/total # of 
positions x 1 00), preferably taking into account the number of gaps and size of said gaps 
necessary to produce an optimal alignment. 

The comparison of sequences and determination of percent homology between 
two sequences can be accomplished using a mathematical algorithm. A preferred, non- 
35 limiting example of a mathematical algorithm utilized for the comparison of sequences 
is the algorithm of Karlin and Altschul (1990) Proc. Natl Acad Set USA 87:2264-68, 
modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci USA 90:5873-77. Such 
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an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of 
'Altschul et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be ' 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST 
5 protein searches can be performed with the XBLAST program, score = 50, wordlength = 
3 to obtain amino acid sequences homologous to protein molecules of the invention. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as 
described in Altschul et al. (\99l\ Nucleic Acids Research 25(17):3389-3402. When 
' utilizing BLAST and Gapped BLAST programs, the default parameters of the respective 

10 programs (e.g., XBLAST and NBLAST) can be used. See http://www.nclii.hlm.nih.gov. 
Another preferred, non-limiting example of a mathematical algorithm utilized for the 
comparison of sequences is the algorithm of Myers and Miller (1988) Comput Appl 
Biosci. 4:1 1-17. Such an algorithm is incorporated into the ALIGN program available, 
for example, at the GENESTREAM network server, IGH Montpellier, FRANCE 

15 (http://vega.igh.cnrs.fr) or at the ISREC server (http://www.ch.embnet.org). When 

utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

i 

In another preferred embodiment, the percent homology between two amino acid 
sequences can be determined using the GAP program in the GCG software package 
20 (available at http://www.gcg.com), using either a Blossom 62 matrix or a PAM250 
matrix, and a gap weight of 12, 10, 8, 6, or 4 and a length weight of 2, 3, or 4. In yet 
another preferred embodiment, the percent homology between two nucleic acid 
sequences can be accomplished using the GAP program in the GCG software package 
(available at http://www.gcg.com), using a gap weight of 50 and a length weight of 3. 

25 

JC Biotransformations and Bioconversions 

Another aspect of the present invention includes biotransformation processes 
which feature recombinant microorganisms (e.g., mutant microorganisms) and/or 
isolated CoA, pantothenate or isoleucine-valine biosynthetic enzymes described herein. 

30 The term "biotransformation process", also referred to herein as "byconversion 
processes", includes biological processes which result in the production (e.g., 
transformation or conversion) of any compound (e.g., intermediate or product) which is 
upstream of a CoA, pantothenate or isoleucine-valine biosynthetic enzyme to a 
compound (e.g., substrate, intermediate or product) which is downstream of said CoA, 

35 pantothenate or isoleucine-valine biosynthetic enzyme. 
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In one embodiment,, the invention features a biotransformation process for the , 
production of a panto-compound comprising contacting a microorganism which 
overexpresses at least one pantothenate bipsynthetic enzymfe with at least one 
appropriate substrate or precursor under conditions such that said panto-compound is 
5 produced and recovering said panto-compound.- In a preferred embodiment, the 

invention features a biotransformation process for the production of pantoate comprising 
contacting a microorganism which overexpresses ketopiantoate reductase (the panE gene 

product) with an appropriate substrate (e.g., ketopantoate) under conditions such that 

i 1 

pantoate is produced and recovering said pantoate. In another preferred embodiment, 
1 0 the invention features a biotransformation process for the production of pantothenate 

comprising contacting a microorganism which overexpresses ketopantoate reductase and 
pantothenate synthetase with appropriate substrates (e.g., ketopantoate and P-alanine) 

| ; 'I 

under conditions such that pantothenate is produced and recovering said pantothenate. 
In yet another preferred embodiment, the invention features a biotransformation process 

15 for the production of pantothenate comprising contacting a microorganism which 
overexpresses ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase with appropriate substrates (e.g., a-ketoisovalerate and p- 
alanine) under conditions such that pantothenate is produced and recovering said 
pantothenate. Preferred recombinant microorganisms for carrying out the above- 

20 described biotransformations include pantothenate kinase mutants. Conditions under 
which pantoate or pantothenate are produced can include any conditions which result in 
the desired production of pantoate or pantothenate, respectively. 

In yet another embodiment, the present invention includes a method of 
producing P-alanine that includes culturing a microorganism which overexpresses 

25 aspartate-a-decarboxylase under conditions such that P-alanine is produced. Preferably, 
the aspartate-a-decarboxylase-overexpressing microorganism has a mutation in a nucleic 
acid sequence encoding a pantothenate biosynthetic enzyme selected from the group 
consisting of ketopantoate hydroxymethyltransferase, ketopantoate reductase and 
pantothenate synthetase. 

30 The invention further features a method of producing p-alanine that includes 

contacting a composition comprising aspartate with an isolated Bacillus aspartate-a- 
decarboxylase enzyme under conditions such that p-alanine is produced (e.g., an in vitro 
synthesis method). 

The microorganism(s) and/or enzymes used in the biotransformation reactions 
35 are in a form allowing them to perform their intended function (e.g, producing a desired 
compound). The microorganisms can be whole cells, or can be only those portions of 
the cells necessary to obtain the desired end result. The microorganisms can be 
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suspended (e.g., in an appropriate solution such as buffered solutions or media), rinsed 
(e.g., rinsed free of media from culturing the microorganism), acetone-dried, 1 
immobilized (e.g., with polyacrylamide gel or k-carrageienan or on synthetic Supports, 
for example, beads, matrices and the like), fixed, cross-linked or permeablized (e.g., 
5 have permeablized membranes and/or walls such that compounds, for example, 

substrates, intermediates or products can more easily pass through said membrane or 
wall). ' , 

Purified or unpurified CoA biosynthetic enzyme(s) (e.g., CoaA and/or CoaX), 
pantothenate biosynthetic enzyme(s) and/or valine-isoleucine biosynthetic enzyme(s) 

10 can also be used in biotransformation reactions. The enzyme can be in a form that 
allows it to perform its intended function (e.g., obtaining the desired compound). For 
example, the enzyme can be in free form or immobilized. Purified or unpurified CoA 
biosynthetic enzyme(s), pantothenate biosynthetic enzyme(s) and/or valine-isoleucine 
biosynthetic enzyme(s) can be coritacted in one or more in vitro reactions with 

15 appropriate substrate(s) such that the desired product is produced. ■ 

With respect to at least the above-described methodologies (e.g., the production 
methodologies of the present invention), at least one aspect of the invention features the 
folowing: embodiments is which the methods do not use microorganisms of the genus 
Corynebacterium and/or microorganisms of the genus Escherichia', embodiments in 

20 which the methods do not use microorganisms selected from the group consiting of 

Escherichia coli and Corynebacterium glutamicum; embodiments in which the methods 
do not use gram negative microorganisms; embodiments in which the microorganisms 
utilized do not include, express or produce nucleic acid molecules, genes or proteins 
(e.g., biosynthetic emzymes) derived from microorganisms of the genus 

25 Corynebacterium and/or microorganisms of the genus Escherichia; embodiments in 
which the microorganisms to not include, express or produce nucleic acid molecules, 
genes or proteins (e.g., biosynthetic emzymes) derived from microorganisms selected 
from the group consisting of Escherichia coli and Corynebacterium glutamicum. 

30 XI. Screening Assays 

Because CoA is an essential factor in bacteria, proteins (e.g., enzymes) involved 
in the biosynthesis of CoA provide valuable tools in the search for novel anti-biotics. In 
particular, the CoaX protein is a valuable target for identifying bacteriocidal compounds 
because it bears no resemblance in primary sequence to mammalian pantothenate kinase 

35 enzymes. Accordingly, the present invention also provides a method (also referred to 
herein as a "screening assay") for identifying modulators, i.e., candidate or test 
compounds or agents (e.g., peptides, peptidomimetics, small molecules or other drugs) 
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/which bind to CoaX, or have a stimulatory or inhibitory, effect on, for ejcampte, coaX^ . 
expression or CoaX activity. 1 , ' 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which are capable of binding to CoaX proteins or a biologically active 
5 portion thereof. In another embodiment, the invention provides assays for screening 
candidate or test compounds which modulate the activity of CoaX proteins or 
biologically active portions thereof. The test compounds of the present invention can be 
obtained using any of the numerous approaches in combinatorial library methods known 
in the art, including: biological libraries; spatially addressable parallel solid phase or 

! . # # It- 1 ' 

10 solution phase libraries; synthetic library methods requiring deconvolution; the 'one- 
bead one-compound* library method; and synthetic library methods using affinity 
chromatography selection. The biological library approach is limited to peptide 
libraries, while the other four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, K.S. (1997) Anticancer Drug 

15 Des. 12:145). S ; 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. Set. U.S.A. 90:6909; Erb f et 
al (1994) Proc. Natl Acad. Set USA 91:1 1422; Zuckermann et al (1994). J. Med. 
Chem. 37:2678; Cho et al (1993) Science 261:1303; Carrell et al (1994) Angew. Chem. 

20 Int. Ed. Engl 33:2059; Carell et al (1994) Angew. Chem. Int. Ed. Engl 33:2061; and in 
Gallop et al (1994) J. Med. Chem. 37:1233. Libraries of compounds may be presented 
in solution (e.g., Houghten (1992) Biotechniques 13:412-421), or on beads (Lam (1991) 
Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria (Ladner USP 
5,223,409), spores (Ladner USP *409), plasmids (Cull et al (1992) Proc Natl Acad Sci 

25 USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); (Devlin 
(1990) Science 249:404-406); (Cwirla et al (1990) Proc. Natl Acad. Scl 87:6378- 
6382); (Felici (1991) J. Mol Biol. 222:301-310); (Ladner supra.). 

In one embodiment, an assay is a microorganism-based assay in which a 
recombinant microorganism which expresses a CoaX protein or biologically active 

30 portion thereof is contacted with a test compound and the ability of the test compound to 
modulate CoaX activity is determined. Determining the ability of the test compound to 
modulate CoaX activity can be accomplished by monitoring, for example, intracellular 
phosphopanthoate or CoA concentrations or secreted pantothenate concentrations (as 
compounds that inhibit CoaX will result in a buildup of pantothenate in the test 

35 microorganism). CoaX substrate can be labeled with a radioisotope or enzymatic label 
such that modulation of CoaX activity can be determined by detecting a conversion of 
labeled substrate to intermediate or product. For example, CoaX substrates can be 
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labeled with 32p s 14q or -*H, either directly or indirectly, and the radioisotope detected 

■ 'i 1 * i ' ■ i ■ t i , . , ' i 

by direct counting of radioemmission or by scintillation counting. Determining the 
ability of a compound to modulate CoaX activity can alternatively bedetermined by 
detecting the induction of a reporter gene (comprising a CoA-responsive regulatory 
5 element operatively linked to a nucleic acid encoding a detectable marker, e.g., 

j. 1 A 

luciferase), or detecting a CoA-regulated cellular response. 1 ( 

In yet another embodiment, a screening assay of the present invention is a cell- 
free assay in which a CoaX protein or biologically active portion thereof is contacted 
with a test compound in vitro and the ability of the test compound to bind to or modulate 

10 the activity of the CoaX protein or biologically active portion thereof is determined. In 
a preferred embodiment, the assay includes contacting the CoaX protein or biologically 
active portion thereof with known substrates to form an assay mixture, contacting the 
assay mixture with a test compound, and determining the ability of the test compound to 
modulate enzymatic activity of the CoaX on its substrates. 

15 Screening assays can be accomplished in any vessel suitable for containing the 

microorganisms, proteins, and/or reactants. Examples of such vessels include microtiter 
plates, test tubes, and micro-centrifuge tubes. In more than one embodiment of the 
above assay methods of the present invention, it may be desirable to immobilize either 
CoaX protein or a recombinant microorganism expressing CoaX protein to facilitate 

20 separation of products and/or substrates, as well as to accommodate automation of the 
assay. For example, glutathione-S-transferase/CoaX fusion proteins can be adsorbed 
onto glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
derivatized microtiter plates. Other techniques for immobilizing proteins on matrices 
(e.g., biotin-conjugation and streptavidin immobilization or antibody conjugation) can 

25 also be used in the screening assays of the invention. 

In another embodiment, modulators of CoaX expression are identified in a 
method wherein a cell is contacted with a candidate compound and the expression of 
coaXmKNA or CoaX polypeptide in the cell is determined. The level of expression in 
the presence of the candidate compound is compared to the level of expression in the 

30 absence of the candidate compound (or to a suitable control, for example, an appropriate 
buffer control or standard). The candidate compound can then be identified as a 
modulator of coaX mRNA or CoaX polypeptide expression based on this comparison. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 

35 agent identified as described herein in an appropriate animal model. For example, an 
CoaX modulating agent identified as described herein (e.g., an anti-bactericidal 
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compound) can,b$ used in an infectious, animal model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. 1 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patents, patent applications 
(including U.S. Patent Application Serial No. 09/400,494, filed September 21, 1999 
(pending), provisional U.S. Patent Application Serial No. 60/210^072, filed June 7, 
2000, provisional U.S. Patent Application Serial No. 60/221,938, filed July 28, 2000 and 
provisional U.S. Patent Application Serial No. 60/227,860, filed August 24, 2000, to 
which this application relates) and published patent applications cited throughout this 
application are incorporated herein by reference. 



V 
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EXAMPLES 

General Methodology: 

Strains. Bacillus subtilis strains of the present invention are generally derived 
from either of two strains. The first is variously named "168", "1A1", or "RL-1". The 
5 genotype is trpC2. This strain was derived from the, wild type "Marburg" strain by 
mutagenesis and has been the basis of much of the molecular biology work done on B. 
subtilis. The second strain is PY7|9, a prototrophic derivative of 168 that was made Trp + 
• by transduction from the wild type strain W23. 

i i 

Media. Standard minimal-medium for B. subtilis is comprised of l 1 >£ Spizizen 
10 salts and 0.5% glucose. Standard solid "rich medium" is Tryptone Blood Agar Broth 
(Difco), and standard liquid "rich medium" is VY, a mixture of veal infusion broth and 
yeast extract. For testing production of pantothenate in liquid test tube cultures, an 
enriched form of VY, called "Special VY" or "SVY" is used. For batch fermentations, 
SVYG and PFMG are used. The compositions of these media are given belo\y. 

T 

I 

1 

15 VY, a rich liquid medium: 25 g Difco Veal Infusion Broth, 5 g Difco Yeast 

Extract, 1L water (autoclave). 

TBAB, a rich solid medium: 33 g Difco Tryptone Blood Agar Broth, 1 L water 
(autoclave). 

MIN> a min imal medium: 100 ml 10 x Spizizen salts; 10 ml 50% glucose; 2 ml 
20 10% arginine HO*; 10 ml 0.8% tryptophan**; water to 1 liter. (*In some cases, 
arginine is omitted or replace by sodium glutamate at 0.04% final concentration. In 
general, B. subtilis grows faster in minimal medium when certain amino acids, such as 
arginine, glutamine, glutamate, or proline, are added as an auxiliary nitrogen source. ) 
(**For strains that are tryptophan auxotrophs, tryptophan is routinely added to most 
25 minimal media.) 

10 x Spizizen Salts: 1 74 g K 2 HP0 4 -3H 2 0; 20 g (NH 4 ) 2 S0 4 ; 60 g KH 2 P0 4 ; 10 g 
Na 3 Citrate*2H 2 0; 2 g MgS0 4 *7H 2 0; water to 993 mis; then add 3.5 ml FeCl 3 solution and 
3.5 ml Trace Elements solution. 

FeChSolution: 4 g FeCl 3 *6H 2 0; 197 g Na 3 Citrate-2H 2 0; water to 1 liter (filter 
30 sterilize) 

Trace Elements Solution: 0.1 5 g Na 2 Mo0 4 -2H 2 0; 2.5 g H 3 B0 3 ; 0.7 g 
CoCl 2 -6H 2 0; 0.25 g CuS0 4 -5H 2 0; 1 .6 g MnCl 2 -4H 2 0; 0.3 g ZnS0 4 -7H 2 0; water to 1 liter 
(filter sterilize). 
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SVY, Special VY, a supplemented* rich medium for testing pantothenate 
production in test tube cultures: 25 g Difco Veal Infusion Broth; 5 g Difco yeast 
extract; 5 g sodium glutamate; 2.7 g ammonium sulfate; 740 ml water (autbclave); add 
200 ml 1 M potassium phosphate, pH 7.0; 60 ml 50% glucose. (*For testing 
5 pantothenate production in liquid SV Y test tube cultures, Na a-ketoisoyalerate and/or 0- 
alanine can be added to a concentration of 5 g/L from filter-sterilized stocks.) 

PFMG, a yeast extract based medium used in fer mentors: 20 g Amberex ! 
i 1003™ yeast extract; 5 g sodium glutamate, 2 g ammonium sulfate; 5 g tryptophan; 10 g 
KH 2 P04; 20 g K 2 HP0 4 -3H 2 0; 1 g MgQ 2 -6H 2 0; OA g CaCl 2 '2H 2 0; 1 g soduim citrate; 
10 0.01 g FeSCV7H 2 0; 1 ml trace elements solution; 20 g glucose; add water to 1 L. 

Glucose or other sugars are fed as needed. Feed solutions can contain minerals, defined 
or food grade nutrients. 

* i ■ 

, PF t a chemically defined pantothenate jree medium for testing pantothenate 

i i 

auxotrophy: 100 ml 10 x Spizizeri Salts; 100 ml 1 x Difco Pantothenate Assay Medium; 
15 10 ml 50% glucose; water to 1 liter. 

For pantothenate auxotrophs, 1 mM Na pantothenate is added to both minimal 
and rich media, since there is generally not enough pantothenate in rich media to support 
B. subtilis pan mutants. Amino acids are at 100 mg per liter, when used. 
Selection for antibiotic resistance is done with 5 mg/L chloramphenicol, 100 mg/L 
20 spectinomycin HC1, 15 mg/L tetracycline HQ, or 1 mg/L erythromycin plus 25 mg/L 
lincomycin. 

Pantothenate Assays: Biological assay. The indicator organism, Lactobacillus 
plantarum, requires pantothenate for growth, and responds to low concentrations (ng/L). 

25 Thus, using serial dilutions, a wide range of concentrations can be assayed. 

Commercially available medium (e.g., Pantothenate Assay Medium (PAM), Difco), can 
be used. However, Difco PAM supplemented with pantothenate does not support 
growth to the same level as obtainable using a fresh-mixed version of Pantothenate 
Assay Medium (FM-PAM), made up of the individual components as specified by 

30 Difco, which is accordingly, routinely used instead of the commercial product. 

Before assaying B. subtilis culture supernatants, the B. subtilis cells must be 
either removed or killed. B. subtilis culture supernatants give approximately the same 
pantothenate titer when the supernatants are autoclaved as when they are sterile filtered. 
Accordingly, routine procedures involve autoclaving samples for 5 minutes prior to the 

35 biological assay. 
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Pantothenate Assays : HPLC assay. Pantothenic acid production is measured 

by HPLC with a detector wavelength of 197 nm ancl a reference at 450 nm. The 

procedure is a modification of one recommended by Hewlett-Packard for water soluble 

vitamins. Samples of culture broth are diluted into an equal volume of 60% acetronitrile 

5 (ACN), centrifiiged and filtered. Typically a further 10-fold dilution before analysis 

1 i i i ■ 1 

brings the final dilution to 20-fold. Higher concentrations of prodyct are diluted further. 

Compounds are separated on a CI 8 Phenomenex 5 p. Aqua 250 x 4.6 mm column with 

5% acetronitrile (ACN) in 50 mM Na phosphate buffer at pH 2.5. An ACN gradient 

from 5% to 95% washes the column between every sample. The area of the 

10 pantothenate peak is proportional to the concentration between 5 to 1000 mg/L. Other 

panto-compounds are also separated and quantitated by this method. 

Amino Acid Analysis: HPLC assay. Amino acids present in the fermentation 
medium and throughout the fermentation are measured by HPLC with a detector 
wavelength of 338 nm and a reference at 390 nm. The procedure is a modification of 

1 5 one recommended by Hewlett-Packard for amino acid analysis. Samples of culture 
broth are prepared identically as for the panto-compound analysis. Compounds are 
separated on a CI 8 Hypersil 5\i ODS 200 x 2.1 mm column. Solvent A is 20 mM Na 
acetate buffer at pH 7.2. Solvent B contains 40% ACN and 40% methanol. A gradient 
from 100% Solvent A to 100% Solvent B separates amino acids and washes the column 

20 between every sample. 

Batch Fermentations. Pantothenate producing strains are grown in stirred tank 
fermentors, for example, in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Computer control and data collection is by commercial software, for example, 
B. Braun Biotech MFCS software. Fermentations can be batch processes but are 
25 preferably sugar-limited, fed batch processes. Some media components {e.g. ofSVYG 
and PMFG) are added to the fermentor and sterilized in place. Portions of the media are 
sterilized separately and added to the fermentors aseptically. This procedure is well 
known to those familiar with the art. Additional nitrogen sources in feeds are sterilized 
separately and added to the carbon source after cooling. 

30 The initial sugar in the medium is consumed in approximately 6 hours. 

Afterwards, glucose or other sugars are fed with the possible addition of minerals, and 
defined or food grade nutrients. Alternatively, feeds are scheduled based on a consensus 
profile of nutritional requirements from samples taken from earlier fermentations. 
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t After inoculation, agitation is set at a relatively low speed, e.g. 200 rpnv When . 

the dissolved oxygen (p02) falls to 30%, computer control automatically adjusts the 
agitation to maintain a dissolved oxygen concentration between 25 and 30% p02. 

5 EXAMPLE I: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing/m/iZJCZ) Gene Products. 

1 This Example describes the cloning of the B. subtilis panBCD operon and the . 

generation of microorganisms overexpressing the panBCD gene products. 

To clone the B. subtilis panBCD operon, a plasmid library of B. subtilis GP275 
10 (a derivative of 168) genomic DNA was transformed in E. coli BM4062 (birA*\ and 
temperature resistant clones were selected at 42°C. By comparison of restriction maps 
to the genome sequence, one particular clone was deduced to contain the B. subtilis birA 

* 4 

gene and the adjacent panBCD genes. This plasmid was named pAN201. 

To overexpress the panBCD operon and produce pantothenate, the native 

1 5 promoter, of the panBCD operon was replaced by either of two strong, constitutive 
promoters derived from the B. subtilis bacteriophage SP01. These two promoters are 
named P 2 e and P iS . In addition, either of two artificial ribosome binding sites (RBSs) 
were used to replace the native panB RBS. These two artificial RBSs (set forth as SEQ 
ID NO:49 and SEQ ID NO:50) were predicted to increase translation of panBCD; their 

20 sequences are shown in Table 1 A. Three such engineered panBCD expression cassettes 
were built into circular plasmids capable of replicating in E. coli. Other features of the 
plasmids include a strong rho-independent transcription terminator from the E. coli 
ribosomal RNA transcription unit, called TiT 2 , a Gram-positive chloramphenicol * 
resistance gene (cat), derived from pC194, and a pair of NotI restriction sites at the 

25 junctions between the E. coli replicon and the segment intended for integration into B. 
subtilis. Three plasmids of this series, pAN004, pANOOS, and pAN006 were 
constructed. pAN004 contains the P 26 promoter, RBS1, and a low copy E. coli replicon. 
pANOOS contains the Pis promoter, which in our experience is not as strong as P 2 6 9 
RBS1 , and the low copy replicon. pAN006 contains the P26 promoter, RBS2, and a 

30 medium copy replicon. 

The three panBCD expression cassettes contained in the above-mentioned three 
plasmids were all ligated to a DNA fragment consisting of sequences that naturally 
occur immediately upstream from the native panB gene and integrated in single copy by 
homologous recombination into the panBCD locus of 5. subtilis strains RL-1 and PY79, 

35 replacing the wild-type operon. This was accomplished in two steps. First a deletion- 
substitution that replaced about two thirds of the panB coding region with a Gram- 
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positive spectinomycin resistance gene (spec) was integrated at panB to yield Spec", 
pantothenate auxotrophs. These intermediate strains were than transformed with the 
panBCD expression cassettes of pAN004, pAN005, and pAN006 after ligating them to a 
DNA fragment containing chromosomal sequences just upstream of panB. Selection of 
5 the incoming cassette was for pantothenate prototrophy. The resulting strains were - 
named PA221 , PA222 and PA223 (from RL T 1), and PA235, PA232 and PA233 (from 
PY79), respectively. An example of a plasriiid that contains the joined upstream 
sequence that is in the integrated strain in PA221 is pAN240 (see Figure 2). The 
nucleotide sequence of pAN240 is set forth as SEQ ID NO:76. 

10 Polymerase chain reaction using appropriate primers was used to verify the 

correct chromosomal structures of these engineered strains. When extracts of strain 
PA221 were examined by SDS-PAGE, two proteins were found to be overexpressed. 
One protein had an apparent molecular weight of 29,000 and the other protein appeared 
to be 39,000 daltons. The 29,000 dalton bands is presumably PanB (predicted molecular 

15 weight of 29,761). The larger protein band presumably represents PanC (predicted size 
3 1 ,960 daltons). 

The ability of these strains to produce pantothenate in test tube cultures was 
assesed as follows. Each strain was grown in SVY medium supplemented with 5 g/L cc- 
ketoisovalerate (ot-KIV) and 5 g/L p-alanine, to ensure that these precursors were not 
20 limiting. Culture supernatants were autoclaved and assayed using the bioassay. 

Relative to the parent strains, RL-1 and PY79, the engineered strains produced about 8- 
to 30-fold more pantothenate, attaining 1 g/L pantothenate in some cases. 
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Table 2. Production of pantothenate; by engineered B. subtilis strains in liquid test 

tube, cultures grown in §VY medium with 5 g/L a-KIV and 5 g/L /3- 
alanine. ' > 



Expt. 



Strain 



Promoter 



RBSat 
pariB 



[pantothenate] 
mg/L 



RL-1 
PA221 



i 1 



PA222 

PA223 

PY79 
PA235 

PA221 



Native 



IS 



26 



Native 



26 



Native 
RBSl 

i 

I 

RBSl 

i 

RBS2 

Native 
RBSl 

RBSl 



30 

990 
790 

250 
250 

790 
790 

40 

930 
860 

1100 
1030 



5 The P 2 6 promoter was about 3- to 4-fold more effective than the P i5 promoter, 

while RBSl and RBS2 were roughly equivalent. Plasmids such as pAN004, pAN005, 
pAN006 can also be recombined as circles into the B. subtilis wild type panBCD locus 
by Campbell-type (single crossover) integration, selecting for chloramphenicol 
resistance at 5 mg/L. Strains obtained in this fashion produce about the same amount of 

10 pantothenate as strains PA221, PA222, and PA223, respectively. pAN004 containing 
the P26 promoter, RBSl and a low copy E. coli replicon, is depicted schematically in 
Figure 3A. The nucleotide sequence of plasmid pAN004 is set forth as SEQ ID NO:93. 
pAN006 containing the P26 promoter, RBS2 and a medium copy E. coli replicon, is 
depicted schematically in Figure 3B. The nucleotide sequence of plasmid pAN006 is set 

1 5 forth as SEQ ID NO:94. The nucleotide sequence of panBCD is set forth as SEQ ID 
NO:59 and the predicted amino acid sequences of PanB, PanC and PanD are set forth as 
SEQ ID NO:24, SEQ ID NO:26 and SEQ ID NO:28, respectively. Methods for 
manipulating Bacilli are described, for example, in Harwood, C.R. and Cutting, S.M. 
(editors), Molecular Biological Methods for Bacillus (1990) John Wiley & Sons, Ltd., 

20 Chichester, England, the content of which is incorporated herein by reference. 
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EXAMPLE II: Enhanced Production of a Panto-Compound Using Bacteria 
Overexpressing the panEl Gene Product -Ketopantoate Reductase. , 1 

This Example describes the cloning of the B, subtilis panEl gene and' the 

J 

generation of microorganisms overexpressing the panEl gene product. 

t 

5 Pan" B. subtilis strains (e.g., B. subtilis mutants blocked in the synthesis of 

pantothenic acid) had previously been isolated, one of which was reported to be affected 

* • i 

in ketopantoate reductase activity (Baigori et ah (1991) J. Bacteriol 173:4240-4242)j 
However, the mutations in these strains were incorrectly mapped to the purE-tre interval 
of the B. subtilis genetic map which does not contain the panE or panBCD { genes. 

10 Furthermore as shown below, a panE mutant does not have a Pan* phenotype as the ilvC 
gene product can substitute for the panE gene product in B. subtilis as in other bacterial 
strains such as E. coli. More recently, the S. typhimuruim panE gene has been located 
and determined to be allelic to apbA, a gene required for anaerobic £urine biosynthesis 
(Frodyma et al (1998) J. Biol Chem. 273:5572-5576). E. coli carries a highly 

15 homologous gene at the same map location. Identification of the panE genes in E. coli 
and S. typhimurium was complicated by the fact that the ilvC gene product, 
acetohydroxy acid isorneroreductase, is also capable of carrying out the ketopantoate 
reductase reaction. As a result, pantothenate auxotrophy is not obtained unless both 
panE and ilvC are mutated. 

20 To identify the B. subtilis panEl gene, the B. subtilis genome was searched using 

the protein sequence of E. coli or S. typhimurium ApbA (PanE), and two open reading 
frames were identified having homology to ApbA, named ylbQ and ykpB. These genes 
were renamed panEl and panE2, due to their proposed function in pantothenate 
biosynthesis. Both panEl and panE2 were cloned as PCR products generated from 

25 RL-1 genomic DNA as a template. Both genes were disrupted by either a 

spectinomycin resistance gene (spec) or a chloramphenicol resistance gene (cat). The 
interrupted genes were each integrated by double crossover into PY79 to give PA240 
(ApanEl::spec) and PA241 (ApanE2: :cat). Neither of these strains were pantothenate 
auxotrophs when tested on panto thenate-free (PF) plates, although PA240 containing 

30 ApanEl::spec grew slightly more slowly on TBAB without added pantothenate than 
with a 1 mM pantothenate supplement. By comparison, a ApanB::spec strain does not 
produce single colonies on TBAB, presumably because B. subtilis has no active uptake 
system for pantothenate. 

It was hypothesized that the B. subtilis gene, ilvC, could function for panE as had 

35 been shown for E. coli. Accordingly, the panEl and panE 2 disruptions were introduced 
into a strain, CU550, which is reported to be trpC2 ilvC4 leuC124. Both the single 
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panEl and the double panEl 9 panE2 disruptants were pantothenate auxotrpphs on PF 
medium. 



Table 3. 



Phenotypes of various panEl and panE2 mutants on rich and defined 
media. ' . 



Strain 



Medium 



Growth*: 

- pan, + pan 



PY79 



PA240 



PA241 



CU550 



PA256 



PA258 



TBAB 
PF 

TBAB spec 
PF 

TBAB cam 
PF 

TBAB 
PF 

TBAB spec 
PF 

TBAB spec, cam 
PF 



+++ 



■+-++ 



I.' < 



■H-+ 



++ 

:+++ 



+++ 



+++ 



*Each represents about 1 mm of colony diameter after overnight at 37°C. 

Thus, mutating both panEl and ilvC results in pantothenate auxotrophy, while 
mutating only panEl does not, similar to what has been reported for E.coli and S. 
1 0 typhimurium. 

Next, the quantitative effect of panEl and panE2 knockouts in a pantothenate 
overproducing strain (PA235 described herein) was examined. The panEl and panE2 
disruptions were introduced into PA235, either singly or together to produce PA245 
(ApanE 1 ::spec), PA248 (ApanE2::cat) and PA244 (ApanEl::cat 9 Ap<mE2::spec). The 
15 effect of each mutation on pantothenate production was then tested in liquid test tube 
cultures. 
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Table 4. Pantothenate production by PA235 derivatives containing pan El and 

i j . . ,i .. . f ; . 

panE2 disruptions. ( i 



< i 



Strain [pan] mg/L %ofPA235 

PA235 
PA235 
PA245 

PA245' , 
PA248 
PA248 
PA244 
PA244 



.990 


(100) 


940 


95 


59 


1 1 6 


82 


8 


1060 


106 


1030 


104 

i 


25 


3 


50 


5 



Thus, deletion analysis indicated that the panEl gene contributes to over 90% of 
5 the pantothenate production, while deletion of panE2 did not have a significant effect on 
pantothenate production. It is therefore concluded that panEl accounts for most, but not 
necessarily all, of the ketopahtoate reductase activity in B. subtilis. The rest of the 
ketopantoate reductase activity is predicted to be supplied by /7vC. 

i 

Having identified panEl as an important gene for pantothenate production, 

1 0 increased panEl expression was tested to determine whether it could enhance 
pantothenate production in strains such as PA221 or PA235. The panEl coding 
sequence was installed downstream of the P26 promoter and RBS2 in a vector, pOTP61, 
designed to integrate and amplify at either the bpr locus (a non-essential protease gene) 
or at the locus of the cloned insert. The resulting plasmid, pAN236 (Figure 4) was 

15 transformed into PA221 , selecting for resistance to tetracycline at 15 mg/L. The 
nucleotide sequence of pAN236 is set forth as SEQ ID NO:77. One transformant, 
named PA236 was chosen for further study. 

PA236 was shown to overexpress a protein of about 31,000 daltons, which is 
close to the expected molecular weight of 33,290 daltons for panEl protein. Briefly, 

20 whole cell extracts were prepared from P Y79, RL- 1 , P A22 1 , PA22 1 /pOTP6 1 and PA236 
(2 samples). Cell extracts were separated by gel electrophoresis and the gels were 
coomassie stained to visualize proteins. In cells engineered to overexpress panE 
(PA236-1 and PA236-2), a band was visible having an approximate molecular weight of 
-3 1,000 daltons (as compared to molecular weight markers). Moreover, PA221 and 

25 PA236 expressed increased levels of a -29,000 dalton band, corresponding to the panB 
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gene product, and a -39,000 dalton band, presumably corresponding the panC gene 
product. Furthermore, E. coli transforrhed with pAN006 (Figure 3B) expressed bands 
correlating to the panB and panC gene products and E. coli transfected With PAN236 

expressed a -3 1,000 dalton band corresponding to the panE gene product. 

t * ' * 

Next, PA236.was compared to PA221 carrying the empty vector pOTP61 for 

pantothenate production in liquid test tube cultures supplemented with 5 g/L p-alanine 

and 5 g/L a-KIV, •' 1 . . 



10 



Table 5. 



Effect of overexpression of pariEl and panE2 on pantothenate 
production by engineered strains in liquid test tube cultures. 



Strain 



Additional 
Plasmid 



Gene [Pantothenate] 
Overexpressed mg/L 



PA221 pOTP61 



PA236 pAN236 



PA238 pAN238 



none 



panEl 



panE2 



1,000 
940 
2,030 
2,050 
530 
680 



Overexpression of panEl caused a two-fold increase in pantothenate production 
when compared to the parent strain (e.g., to slightly over 2 g/L) whereas overexpression 
of panE2 resulted in a strain that produced about 35% less pantothenate than the parent 
15 strain. The panEl nucleotide sequence and predicted amino acid sequence are set forth 
as SEQ ID NO:29 and SEQ ID NO:30. 

EXAMPLE III: Enhanced Production of a Panto-Compound by Culturing 
Bacteria Overexpressingpa/i£ , 7 or panBCD in the Presence of Valine. 

20 The ability of valine to function as a media supplement {e.g., as a substitute for 

a-KIV) in strains engineered to overexpress the panBCD operon and panEl was 
evaluated. Valine is closely related to a-KIV by transamination, is less expensive than 
a-KIV, and is commercially available in kilogram quantities. Valine was substituted for 
a-KIV in the standard liquid test tube cultures in SVY medium. The concentration of 

25 valine was varied from 5 to 50 g/L. Although valine at 5 g/L was slightly less effective 
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than cc-KIV in promoting pantothenate production, valine at 10 or 20 g/L equaled or 
surpassed 5 g/L a-KIV in promoting pantothenate production. 1 

» ■ - ' 

EXAMPLES IV-X Generation of Microorganisms Capable of Producing 
5 Pantothenate in a Precursor-Independent Manner 

B. subtilis strains such as PA221 and PA235 (engineered to overexpress 
panBCD) and PA236 (engineered' to overexpress panBCD and panEJ) need to' be fed a- 
ketpisovalerate (a-KIV) (or valine) and aspartate (or p-alanine) to achieve maximal 
pantothenate production, as both these precursors are limiting for pantothepate 
10 synthesis. Accordingly, manipulated microorganisms were designed to eliminate the 
need to feed limiting precursors of pantothenate biosynthesis in the production of 
pantothenate. These strains are also useful in the production of various pantothenate 

biosynthetic pathway intermediates. 

i 

! . 

15 EXAMPLE IV: Generation of Microorganisms Capable of Producing 
Pantothenate in an Aspartate- (or p-AIanine) Independent Manner 

The pant) gene was cloned into B. subtilis expression vector pOTP61 to 
construct pAN423 (Figure 5). The nucleotide sequence of pAN423 is set forth as SEQ 
ID NO:78. The Noil restriction fragment containing panD was isolated from pAN423, 
20 self Hgated and used to transform PA221 . Transformants resistant to Tet 15 , Tet 30 , and 
Tet 60 were isolated and saved for further analysis. 

Six of the pAN423 transformants plus two control transformants were grown in 
SV Y containing 5 g/1 a-KIV with and without 1 0 g/1 aspartate and then assayed for 
pantothenate production (Table 6). 

25 

Table 6. Effect of overproducing PanD on pantothenate production with and without 
added aspartate. 



Culture* 
(PA221 transformants) 


Asp 
(10 g/L) 


TetR** 
(Hg/ml) 


OD550 


[pan] 
(mg/L) 












pOTP61-l 




60 


8.0 


76 


pOTP61-2 




60 


7.7 


91 


423#1-1 




15 


8.5 


180 


423 #1-2 




15 


8.0 


150 


423#l-3 




30 


8.3 


220 


423#l-4 




30 


8.5 


280 


423#l-5 




60 


8.9 


580 


423#l-6 




60 


8.8 


280 
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1 1 


i 




1 

1 




pOTP61-l . ' i 


+ • 

J 


60 


7,5 


. 380 


pOTP61-2 


+ 


60 


6.9 


560 


423#1-1 


+ 


15 


8.5 


1200 


423#l-2 


+ 


15 


8.6 


1000 


423#l-3 


+ 


30 


8.8 


1200 


423#l-4 


+ , 


30 


9.0 


■■ 1200 


423#l-5 


+ 


60 


9.0 


■ 1200 


423#l-6 


. t, 


60 


9.0 


1200 



*Test tubes cultures were grown in SVY + otrKIV (5 g/L) with Asp (10 g/L) where 
indicated. i 

i * 



**TetR — Approximate Tet-resistance of transformant 

The pAN423 transformants produced at least twice the amount of pantothenate 
5 as the controls (i.e., to a level at or near that which was obtained in earlier experiments 
by the addition of p-alanine to the culture medium). The data also show that in the 
absence of added aspartate, transformants containing additional copies of the panD gene 

I I * i 

expression cassette produce more pantothenate than the control transformants. One of 
the transformants, 423#l-5, produced about five times as much pantothenate as the 
10 controls. These results indicated that increased levels of PanD protein "pull" the 
conversion of available aspartate towards p-alanine, and that increasing panD gene 
expression can result in enhancement of pantothenate production both in the presence 
and absence of added aspartate. 

Transformant 423#l-5 was re-named strain PA401 and studied further in shake 
15 flask fermentations. The shake flask medium was SVY with maltose instead of SVY 
with glucose. Results of shake flask experiments agreed well with test tube experiments 
during the first 24 hours. In shake flask experiments without the addition of p-alanine, 
PA401 produced approximately 1 .5 g/1 of pantothenate in 24 hours. Addition of p- 
alanine to the culture medium did not further improve pantothenate titers (Table 7), 
20 indicating that with this strain and these fermentation conditions, P-alanine is not 

limiting pantothenate production. In fact, when no P-alanine is fed, one can observe that 
PA401 is secreting P-alanine in significant amounts into the medium. 
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Table 7 Shake flask cultures with strain PA 401 (panD) with and without fi-alanine. 





Amino acids (g/l) 


i , i 

24 hours ' 


Initial 
, P-ala 
Added 


P-ala Val 

l 


PH 

i 


OD600 

1 


Pantothenate 
(g/l) 


0 

1 


0.7 


1.5 


7.5 


13.7 , 


1.5 1 - . 

i 

t 

i 


5 g/l 


7.1 


1 4 


7.6 


12.4 


1.5 



i 



Each value represents the average of duplicate 250 ml baffled flasks containing 50 ml of 
5 medium, incubated at 37°C with shaking (200 rpm). 
Base Medium: SVY with 10 p/1 a-KIV, 30 g/l maltose 
2% Inoculum: SVY with Tet, grown 24 hours. 

! 

EXAMPLE V: Engineering the panD gene for Further Increased Synthesis of 
10 Aspartate Decarboxylase and Enhanced Production of Pantothenate 

This Example describes the generation of improved ribosome binding sites 
(RBSs) in the panD gene to increase the translation of panD mRNA. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 
RBSs 

1 5 The RBS (SEQ ID NO:88) used to express panD in pAN423 is a synthetic RBS 

and has been used to successfully produce other proteins in B. subtilis at a high level. 
However, it contains six mismatches when aligned to the "ideal" B. subtilis RBS (SEQ 
ID NO:45) (e.g., an RBS having a sequence which is complementary to the 16S RNA 
sequence within the B.subtilis ribosome). (See e.g., Table IB, mismatches in bold), 

20 Two new RBSs were designed to more closely mimic the ideal RBS. These synthetic 
RBSs, named new design A (NDA) and new design B (NDB) (also referred to herein as 
RBS3 and RBS4), are set forth as SEQ ID NO:51 and SEQ ID NO:52 and are aligned 
with the ideal RBS in Table IB. 

Oligonucleotides corresponding to the top and bottom strands of each new RBS 
25 were synthesized, annealed, then used to replace the RBS in pAN420, generating 

plasmids pAN426 and pAN427. These constructions are illustrated in Figure 6. The 
presence of the NDA and NDB RBS in pAN426 and pAN427 was confirmed by DNA 
sequence analysis. Next, the panD genes from pAN426 and pAN427 were transferred 
to B. subtilis expression vector pOTP61 as shown in Figure 7, creating pAN428 and 
30 pAN429. The nucleotide sequence of pAN429 is set forth as SEQ ID NO:79. 
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, Noil restriction fragments lacking the E. cpli vector sequences wer^ isolated 

' I-'*''. \ ' 

from pAN428 and pAN429, self-ligated, and used to transform strain PA221 to 

1,1 

resistance to Tet 15 . Four isolates resistant to Tet 60 were picked from each transformation 
and assayed for pantothenate and P-alanine production along with PA221 transformed 
with the empty vector (pOTP61) and PA221 transformed \yith pAN423 (strain PA401) 
(see Table 8). 



Table 8. 



Panthothenate production by test tube cultures of PA221 transformed 
with pAN 428 and p X AN 429 



Plasmid 




r 

f 

Medium 


OD550 


Pan 


p-Ala 






ouppiemenis 

j 




n/i ' 
g/1 




pOTP6-l 




t : 

• 

5 

a-Kiv : 


■ 

\M 


■ ■ m I m _r» 




pAN423 




a-KIV 3 I 


10 


0.4 


0.04 

i 


pAN428-l 


* 


a-KIV 5 ■■ 


12 


0.6 


0.04 


pAN428-2 


t 


5 

a-KIV 


1 1 


0.5 


0.03 


pAN428-3 




a-KIV 


1 1 


0.3 


0.03 


pAN42o-4 




a-KIV 


10 


0.1 


UND 


pAN429-l 




a-KIV 5 


12 


0.6 


0.04 


pAN429-2 




a-KIV 5 


11 


0.5 


0.04 


pAN429-3 




a-KIV 5 


11 


0.6 


0.05 


pAN429-4 


# 


a-KIV 5 


12 


0.8 


0.10 


pOTP61 




a-KIV 5 + Asp 10 


11 


0.5 


0.08 


pAN423 




5 10 
a-KIV + Asp 


12 


0.9 


1.32 


pAN428-l 


* 


a-KIV 5 + Asp 10 


12 


0.8 


1.97 


pAN428-2 




5 10 
a-KIV J + Asp 


12 


0.8 


1.51 


pAN428-3 




5 10 
a-KIV - + Asp 


12 


0.9 


1.02 


pAN428-4 




5 10 
a-KIV J + Asp 


11 


0.8 


0.30 


pAN429-l 




5 10 
a-KIV 0 + Asp 1 u 


12 


0.8 


1.78 


pAN429-2 




5 10 
a-KIV + Asp 


12 


0.8 


1.66 


pAN429-3 




5 10 

a-KIV + Asp 


12 


0.8 


1.78 


pAN429-4 


# 


5 10 
a-KIV J + Asp ,v 


13 


0.8 


2.28 



UND: Below the limits of detection. * Renamed PA402 # Renamed PA403 

When grown in medium supplemented with a-KIV at 5 g/1 (a-KIV 5 ), the 
pAN428-l transformant and all four of the pAN429 transformants produced more 
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pantothenate than did PA401, suggesting that these transformants contain higher levels 
of aspartate decarboxylase activity. When grown in medium supplemented with a-KIV 5 
and Asp 10 none of the pAN428 or pAN429 transformants produced more pantothenate 

i 

than PA401. However, the pAN428-l transformant and all four of the pAN429 

transformants produced significantly more P-alanine than did PA401. It is possible that 

i i i * ■ ■ 1 

the excess ^-alanine produced from added aspartate causes inhibition of pantothenate 

production. Alternatively, p-alanine may accumulate because pahtoate is limiting in 

these strains. 

The strains that produced the highest level of p-alanine, the pAN428-l and 
pAN429-4 transformants, were renamed PA402 and PA403, respectively. These two 
strains were grown in SVY medium supplemented with various intermediates and 
reassayed for pantothenate and 6-alanine production. PA221 and PA401 were included 
as controls. The results of the assays are presented in Table 9. 



1 5 Table 9. 



Pantothenate production ofPA402 and PA403 in test tube cultures. 



Strain 



Medium 
Supplements 



OD550 



Pan 

g/1 



P-Ala 

g/1 



Val 

g/1 



PA221 
PA401 
PA402 
PA403 



PA221 
PA401 
PA402 
PA403 



a-KIV; 
a-KIV" 
a-KIV' 
a-KIV : 



7.9 
8.7 
8.5 
9.4 



UND 
0.3 
0.5 
0.7 



a-KIV + Asp 

5 10 
a-KIV + Asp 1 u 

a-KIV 5 + Asp 10 

5 10 
a-KIV + Asp 



9.8 
9.1 
9.4 
9.7 



0.4 
0.8 
0.8 
0.7 



UND 
0.04 
0.04 
0.07 



0.1 1 
1.15 
2.02 
2.40 



0.9 
0.9 
0.9 
0.9 



0.8 
0.8 
0.8 
0.8 



PA221 
PA401 
PA402 
PA403 



Pantoate; 
Pantoate; 
Pantoate] 
Pantoate' 



8.9 
8.7 
10.6 
10.5 



UND 
0.3 
0.5 
0.7 



UND 
0.02 
0.02 
0.02 



0.2 
0.2 
0.2 
0.2 



PA221 
PA401 
PA402 
PA403 



Pantoate, + Asp^ 

Pamoate, + A s Pjq 
Pantoate ^ + Asp j ^ 
Pantoate + Asp 



9.5 
9.2 
9.1 
10.2 



0.4 
2.2 
2.8 
2.9 



0.06 
0.62 
1.17 
1.58 



0.2 
0.2 
0.2 
0.2 
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UND: Below the limits of detection. , 

When grown in medium supplemented with 1 either a-KiV 5 or Pantoate 5 , PA402 
and PA403 produced significantly more pantothenate than did PA401. As before, even 
though PA402 and PA403 produced significantly more Pralanine than PA401 when 
5 grown in medium supplemented with a-KIV 5 and Asp 10 , they did not produce a 

proportional increase in pantothenate. However, when grown in medium supplemented 
with Pantoate 5 plus Asp 10 , both PA402 and PA403 produced significantly more 
pantothenate than PA401 , about a 30% increase. ( 

j ■ 

It can be concluded from these experiments that the improved NDA and NDB 
10 panD ribosome binding sites, engineered into pAN428 and pAN429, respectively, lead 
to increased levels of aspartate decarboxylase activity. 

Increasing the translation of the panD gene mRNA by generation of synthetic panD 

i i * 

RBSs within the panBCD operon 

15 

l 

The native B. subtilis panD gene, ribosome binding site (RBS) (SEQ ID NO:43), 
which is found in the P2$panBCD operon cassette present in PA221 (and in other 
engineered pantothenate production strains described herein), is shown in Table 1C 
aligned with the ideal ribosome binding site (SEQ ID NO:47). The alignment shows 

20 mismatches between the native B. subtilis panD gene RBS, which is located within the 
coding sequence for PanC, and the the ideal RBS. Three new RBSs (within the P26 
panBCD operon cassette) were generated to increase translation of the panD gene 
mRNA and to yield increased synthesis of aspartate decarboxylase. These synthetic 
RBSs (termed NDI, NDII, and NDIII, also referred to herein as RBSS, RBS6 and RBS7, 

25 respectively) are set forth as SEQ ID NO:55, SEQ ID NO:56 and SEQ ID NO:57, 

respectively) and are included in Table 1C. It should be noted that although changes in 
the panD RBS within the panBCD operon also changes the C-terminal amino acid 
sequence of the PanC protein encoded by that operon, an alignment of known and 
suspected PanC protein amino acid sequences showed that the sequence of the last nine 

30 amino acids of the B. subtilis PanC protein could be altered without affecting any 
conserved amino acid residues indicating that such changes should not reduce 
pantothenate synthetase activity or expression. The new RBSs were synthesized and 
incorporated into the P26 panBCD operon expression cassette as follows. 

First, PCR primers were designed to contain the following elements: (1) a 
35 nucleic acid sequence encoding the first five amino acids of PanD up to and including a 
unique BsiWl restriction site that had been previously introduced into panD by PCR; (2) 
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a stop codon for panC, (3) at least one synthetic RBS; and (4) 30-39 bp of nucleic acid 
sequence having 100% identity with panC upstream of the panD RBS. The printers 

were named TP102, TP103, and TP 10.4* and contain the' NDI, NDII, and NDIII ribosome 

> 

binding sites, respectively. These three primers were used in conjunction with the 5' 
5 primer TP 101, which hybridizes near the start codon of panC, in three independent PGR 
reactions to generate the NDI, NDII, and NDIII PCR products. The PCR products were 
purified, digested with Xbal, then' cloned into plasmid vector pASK-lBA3 which had, 
been digested with Xbal and Sma\. The resulting plasmids were named pAN431, 

pAN432, and pAN433. The construction of pAN431 is illustrated in Figure 8 and is 

t i* . 

10 representative of all three plasmicl constructions. The presence of the desired synthetic 
panD gene RBS in each new plasmid was confirmed by DNA sequencing. 

Next, the modified panC genes containing the new panD RBSs were joined with 

i i 

the panD gene utilizing the unique BsiWl restriction site. This was accomplished by 
isolating the appropriate Nsil-BsiWl. restriction fragments from pAN431, pAN432, and 
15 pAN433 and ligating them with a 2395 bp Nsil-BsiVSl restriction fragment from 

pAN420, which supplied the 55/WI-modified panD gene. These constructions resulted 
in plasmids pAN441, pAN442, and pAN443, respectively. A representative 
construction (pAN441) is illustrated in Figure 9. The nucleotide sequence of pAN443 is 
set forth as SEQ ID NO:80. 

20 The new panD gene RBSs were then substituted into the P26panBCD operon 

expression cassette as follows. First, a deletion-insertion mutation which removes the 
region of panC containing the panD RBS was created. This was constructed by 
digesting pAN430 with a mixture of BspEl and BgRl and recovering the 4235 bp 
fragment which is now missing the 3' end of panC and the 5' end of panD. This 

25 fragment was ligated with anAval-BamHl restriction fragment from plasmid pECC4, 
which contains the chloramphenicol acetyl transferase (cat) gene. The 5' extension 
produced by Aval digestion is compatible with that produced by BspEl while the Bglll 
and BamHl extensions are also compatible. The resulting plasmid was named pAN440, 
and its construction is illustrated in Figure 10, 

30 The resulting deletion-insertion mutation was crossed into the P26 panBCD 

operon via homologous recombination by transforming PA221 with linearized pAN440 
and selecting for resistance to chloramphenicol on Cam 5 plates containing 1 mM 
pantothenate. Several transformants were tested, and were all found to require 1 mM 
pantothenate for growth, as expected. Two of these transformants were remaned 

35 PA408A and PA408B and were assayed for pantothenate production. Neither strain 
synthesized measurable quantities of pantothenate, even when grown in medium 
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containing pantoate and P-alanine at 5 g/1, indicating that the strains are, deficient in 
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pantothenate synthetase activity. Next, the new panD RBSs were crossfed into the P26 

t , t 

panBCD operon by transforming PA408 with linearized pAN441, pAN442, and 
pAN443 plasmid DNA and selecting for growth on TB AB plates without pantothenate 
5 supplementation. A transformation with linearized pAN43,0 (including the native panD 
RBS) was included as a control and was expected to give rise to transfdrmants identical 
to PA221 described herein. Four isolates from each transformation were assayed for t 
pantothenate and p-alanine production in SVY medium supplemented with various 
intermediates (Tables 10 and 1 1). 1 

10 

Table JO. Pantothenate production of PA410 - PA413 in test tube cultures. 



Strain 


RBS 


Medium 


O»550 


Pan 


P-Ala 






Supplements 

! 1 
1 




g/i 


g/1 


PA221 


native 


1 

Pantoate 


11 


UND 


UND 


PA410-1 


native 


Pantoate^ 


12 


UND 


UND 


PA410-2 




Pantoate 


12 


UND 


UND 


PA4 1 0-3 




Pantoate, 


12 


UND 


UND 


PA4 1 0-4 




Pantoate 


12 


UND 


UND 


PA411-1 


NDI 


Pantoate^ 


12 


0.23 


UND 


PA411-2 




Pantoate. 


12 


0.20 


UND 


PA411-3 




Pantoate, 


12 


0.19 


UND 


PA411-4 




Pantoate 


12 


UND 


UND 


PA412-1 


NDII 


Pantoate^ 


12 


UND 


UND 


PA412-2 




Pantoate, 


11 


UND 


UND 


PA412-3 




- Pantoate, 


13 


0.18 


UND 


PA412-4 




Pantoate 


12 


0.18 


UND 


PA413-1 


NDIII 


Pantoate^ 


12 


0.18 


UND 


PA413-2 




Pamoate, 


12 


0.17 


UND 


PA413-3 




Pantoate, 


12 


0.16 


UND 


PA413-4 




Pantoate 


12 


0.17 


UND 



UND: Below the limits of detection. 



15 
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Table 11. Pantothenate production ofPA410~PA413 in test tube cultures. 

i , . .p ,. .... 



Strain 


RBS 


Medium 
Supplements 


i 

i 

0D550 


1 ' 

Pan. 
g/l 


(3-AIa 
g/l 


PA221 


native 


Pantoate^ + Asp^ 


11: 

, < 

1 


0.3 


UND 


PA410-1' 
PA410-2 
PA410-3 
PA410-4 


* 

native 

t 


Pantoate^ + Asp J ^ 
Pantoate^ + ^ s P|q 
Pantoate^ + ^ s P]g 
Pantoate + Asp 


• 12 . 
12 
12 
12' 


f 

0.4 
, 0.4 , 
0.4 
0.4 


UND 
UND 
UND 
UND 


PA411-1 
PA411-2 
PA411-3 
PA411-4 ' 


NDI 


Pantoate^ + Aspj^ 
Pantoate^ + Asp^ 
Pantoate^ + Asp^ 
Pantoate + Asp 


13 
13 
13 
13 


1.7 
1.7' 
1.8 
0.4 


0.4 
0.4 
0.3 
UND 


PA412-1 
PA412-2 
PA412-3 
PA412-4 


NDII 


Pantoate^ + Aspj^ 
Pantoate^ + Asp^ 
Pantoate ^ + Asp^ 
Pantoate + Asp 


13 
12 
12 
12 


0.4 
0.4 
1.6 
1.5 


UND 
UND 
0.3 
0.2 


PA413-1 
PA413-2 
PA413-3 
PA413-4 


NDI II 


Pantoate^ + Aspj^ 
Pantoate^ + Asp^ 
Pantoate^ + Asp^ 
Pantoate + Asp 


13 
13 
13 
13 


1.6 
1.6 
1.7 
1.7 


0.3 
0.4 
0.4 
0.4 



UND: Below the limits of detection. 



5 As expected from previous experiments using PA221, none of the transformants 

that contained the native panD RBS produced measurable quantities of pantothenate 
when grown in medium supplemented with pantoate. However, nine of the twelve 
transformants expected to contain modified panD RBSs produced significant quantities 
of pantothenate (160-230 mg/1) under these conditions, indicating that they possess 

10 elevated levels of aspartate decarboxylase activity. When grown in medium 

supplemented with both pantoate and aspartate, these same nine transformants produced 
approximately four times more pantothenate than those with the native panD RBS. In 
addition, these nine transformants accumulated measurable quantities of P-alanine (230- 
4 1 0 mg/1). All transformants produced roughly equivalent quantities of pantothenate 

15 when grown in medium containing pantoate and P-alanine, demonstrating that each 
contains a functional pantothenate synthetase. 
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15 



20 



25 



30 



35 



, These data demonstrate that the synthetic panD RBSs are about, four times more 

effective than the native panD RBS in directing translation of the panD'gcne mRNA and 
evidence the utility of such synthetic RBSs in enhancing pantothenate production. 
Additional approaches to increasing pantothenate production can include, for example, 

i + 

increasing the half-life of the panD gene. mRNA, increasing the strength of the promoter 
for panD transcription and/or increasing the stability of the PanD protein. 



EXAMPLE VI: Construction of Strains Containing an Integrated P26panEl 

i 

Cassette without an Antibiotic Resistance Gene. 

J 

Example II describes the identification of the B. subtilis panEl gene that encodes 
the enzyme responsible for the majority of the ketopantoate reductase activity in B. 
subtilis. PA236 (containing the pAN236 plasmid) produced about twice as much 
pantothenate (2 g/1) as its parent strain, PA221 (1 g/1) in 24 hour SVY test tube cultures. 
PA236 was presumed to contain ah amplified (—3 copies) integrated pAN236 plasmid 
based on selection for tetracycline resistance (the tetR gene product being encoded on 
the pAN236 plasmid in addition to the P26 panEl cassette). Also useful in the 
methodologies of the present invention are strains that contain a single integrated 
unamplifiable copy of P26 panEl at the panEl locus, for example, without an antibiotic 
resistance gene in the strain. Such a strain was generated as follows. 

A plasmid named pAN251 was derived from pAN236 by inserting additional 
chromosomal sequences just upstream and just downstream from the P26 panEl cassette. 
These additional sequences, which provide homology to allow integration of the P26 
panEl cassette at panEl by double crossover, were obtained by PCR from chromosomal 
DNA as a template. pAN251 is shown in Figure 11. The nucleotide sequence of 
pAN251 is set forth as SEQ ID NO:8L 

Next, a strain was constructed which allowed selection for the incoming P26 
panEl cassette. The strain included the following three components: (1) P26 panBCD; 
(2) ApanEl\ and (3) /7vC", since both panEl and ilvC must be mutated to have a Pan" 
phenotype. The starting strain was CU550 {trpC2 9 UvC4, leuC124). The P26 panBCD 
cassette from PA221 chromosomal DNA was introduced in two steps to create strain 
PA290. Next, ApanEl::spec was transformed into PA290, using chromosomal DNA 
from strain PA240, to give strain PA294 (trpC2 9 ilvC4, leuC124, P 2 s panBCD, 
ApanEl::spec\ which is a strict pantothenate auxotroph. Finally, PA294 was 
transformed with plasmid pAN251, selecting for pantothenate prototrophy, to give strain 
PA303. This strain was expected to have the genotype trpC2 9 UvC4, leuC124 9 P26 
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panBCD, P26 panEl. PA303 was checked for the correct chromosomal structure at the 
panEl locus by PC& using primers that flank the P?6 insertion just upstream of panEL 
The PCR product from PA303 was of the expected size, with a concomitant loss of the 

j 4 

PCR product from the wild type panEl gene, consistent with having obtained the 
5 desired double crossover event. Furthermore, PA303 was tetracycline sensitive, which 
is also consistent with the desired double crossover event, as opposed to a Campbell- 
type single crossover of the plasmids into the chromosome. The irp, ilv 9 and leu 

i* ' ' ' ' ' ' 

auxotrophies from the parent strain were all maintained in PA303. 

In 24 hour liquid SVY test tube cultures, PA303 produced almost the same level 
10 of pantothenate as positive control PA236, and about twice as much as PA221 , which 
does not contain engineered panEl as , indicated in Table 12. 

Table 12. Pantothenate production by 24 hr. test tube cultures of PA303 and 

controls grown in SVY plus 5 g/l a-KIVand 5 g/l fi-alanine. 

1 , 1 

15 



Strain 


OD600 


[pan] g/l 


PA221-1, 


10.9 


0.85 


PA221-2 


10.5 


0.85 


PA236-1 


9.5 


1.74 


PA236-2 


9.3 


1.70 


P A3 03-1 


10.8 


1.66 


PA303-2 


10.7 


1.61 



EXAMPLE VII: Generation of Microorganisms Capable of Producing 
Pantothenate in an a-KIV (or Valine) Independent Manner 

20 a-ketoisovalerate (a-KIV) is a rate limiting intermediate for pantothenate 

production in certain strains deregulated for pantothenate synthesis. Addition of either 
a-KIV or valine at 5 g/l increases pantothenate production about 5-fold in test tube 
cultures with strains such as PA221. In order to alleviate the need to feed either a-KIV 
or valine, strains were engineered that have an increased capacity to synthesize a-KIV. 

25 a-KIV is produced in B. subtilis from pyruvate by the sequential action of three 

enzymes encoded by four genes, ilvB and ilvN 9 z7vC, and ilvD. In a wild type B. subtilis, 
three of the genes (ilvB 9 ilvN> and ilvC) are the first three genes of the large ilv-leu 
operon. The fourth gene necessary for a-KIV synthesis, i/vD s is located by itself 
elsewhere on the chromosome. The B. subtilis ilv-leu operon is thought to be regulated 
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only by leucine levels. Feeding of exogenous leucine reduces transcription of the ilv-leu 

1 * t * * 

operon by about 13-fold, probably by an attenuiation mechanism (Grandoni et al (1992) 
J. Bacteriol 174: 3212-3219). The only known feedback regulation in the ilv-leu 
pathway is the inhibition of the leuA gene product by leucine. 

5 As a first step to deregulate the synthesis of q-KIV, a copy of the ilvBNC region 

from the wild type B. subtilis ilv-leu operonwas isolated by PCR, and installed adjacent 
to the P26 promoter and RBS2 on a vector, pOLL8, that was designed to integrate a 
single P26 expression cassette by double recombination at the amyE locus. The amyE 
gene encodes a nonessential a-amylase, and is a useful locus for installing expression 
10 cassettes. The resulting plasmid, pAN267, is illustrated in Figure 12. The nucleotide 
sequence of pAN267 is set forth as SEQ ID NO:82. pAN267 readily gave stable 
transformants by double crossover at the amyE locus of B. subtilis strains, as described 
in detail below. 

Construction of pantothenate overproducing strains that are leucine prototrophs 

15 Initially, a B. subtilis strain containing ilvC4 and ApanEl was used to introduce a 

single copy of P26 ponEl into the chromosome without using an antibiotic resistance 
gene. The double mutant was required to select for the incoming P26 panEl cassette 
because a ApanEl mutation alone does not result in pantothenate auxotrophy. A strain 
named CU550 was obtained containing ilvC4 to be used as a basis for this type of strain 

20 construction. However, CU550 also contains a closely linked leuC124 mutation, so all 
strains derived from CU550 required leucine. Having shown that the combination of P26 
panBCD and P26 panEl was favorable for pantothenate production, the next step was to 
reassemble this combination of two cassettes in a leucine prototroph. 

Accordingly, the two cassettes were combined in two different strain 
25 backgrounds, RL-1 and PY79. To introduce chromosomal P 2 6 panEl into the PY79 and 
RL-1 strain backgrounds without using an antibiotic resistance gene, a strategy was used 
that did not rely on ilvC4. (The strategy took advantage of the observation that the 
ApanEl mutation causes a pantothenate bradytrophy, manifested by relatively small 
colonies on TBAB (rich) plates). First, ApanB::cat and ApanE::spec were introduced 
30 into both strain backgrounds. Next, the resulting strains were transformed 

simultaneously with DNA from two strains, PA221 (P 26 panBCD) and PA303 (P 2 e 
panEl), selecting for Pan + on TBAB plates. Colonies of two distinct sizes grew on the 
selective plates, with the larger size comprising about 2% of the colonies. The larger 
colonies were presumed to represented co-transformants that received both P26 panBCD 
35 and P26 panEl, and that the smaller colonies had received only P26 panBCD. Consistent 
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with this prediction, the larger colonies had lost both Cam r and Spec r , while the smaller 
colonies had lost only the cat gene, and retained the spec gene. Furthermore, a ' 
representative derivative of PY79 named PA327, and a representative derivative of RL- 

i 

1, named PA328, both produced the elevated levels of pantothenate in test tube cultures 

5 which was about 1.6 to 1.7 g/l (Table 13). 

1 ' , ' ' ' 

, * t 

Table 13. Pantothenate production ofPA327, PA328, and controls from 24 hr 

test tube cultures grown in SVYplus 5 g/l a-KIV and p-alanirie. 



Strain^ 


Background 


P26 ponEl copy 
number 


[pan] I g/l 


PA221-1 


RL-1 


0 


0.92 


PA221-2 


RL-1 

' i 


0 


0.95 


PA236-1 


RL-1 


amplified (~3) 


1.60 


PA236-2 


RL-1 | 


amplified (—3) 


1.73 


PA327- 1 


PY79 


1 


1 .66 


P A3 27-2 


PY79 


1 


1.65 


PA328-1 


RL-1 


1 


1.61 


PA328-2 


RL-1 


1 


1.91 



10 

Thus, PA327 and PA328 were concluded to contain both P 2 6 panBCD and P26 
panEl, and were used for further constructions as described below. PGR analysis 
confirmed the presence of the two cassettes. 

15 Installation of a stable P26 ilvBNC cassette into two lineages of pantothenate 
overproducing strains 

Having constructed PA327 and PA328, derivatives of PY79 and RL-1 that 
contain P26 panBCD and P26 panEI, and that are Leu + , the next step was to introduce 
stable copies of P26 ilvBNC. This was accomplished by transforming PA327 and 
20 PA328 with plasmid pAN267, selecting for Spec r . Screening by PCR showed that about 
85% of the obtained transformants contain P26 ilvBNC integrated at amyE by double 
crossover. One transformant of PA327, named PA340, and one transformant of PA328, 
named PA342, were chosen for further study. 

In test tube cultures grown in SVY medium plus 5 g/l |3-alanine but without 
25 added a-KIV, both PA340 and PA342 gave the expected increase in pantothenate 
production over that of PA327 and PA328, to about 1.3 to 2 g/l (Table 14). 
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Table 14. Pantothenate and valine production By PA340 and PA342, both 1 

containing P 2 6 ilvBNC in. 24 hr test tube cultures grown in SVYwith 5 g/l 
fi-alanine and with or without 5 g/l a-KIV 



Strain 

i 


Back- 
ground 


OD600 
- a-KIV 


* 

+ a-KIV 

1 


[pan] g/l 
-a-KIV 


+ a-KIV 


M] g/l 
- a-KIV 


+ a-KIV 


PA340-1 


PY79 


11.8 


7.1 




2.02 


2.10 


i , 

0.38 


1 

I 

0.90 


P A3 40-2 


PY79 


10.3 


7.5 


i 


1.97 


2.03 


0.40 


0.91 


PA342-1 


RL-I 


10.2 


8.0 


> 


1.29 


1.89 


0.27 


t 

0.78 


PA342-2 


RL-1 


9.6 


9.2 


! i 


1.34 


2.04 


0.21 


0.79 



The two new strains also g&ve a slight increase in valine secretion, indicating 
that the ilvBNC genes had been deregulated. However, when the same strains were 
grown with 5 g/l a-KIV added, a further increase in pantothenate production occurred 
10 from PA342, suggesting that a-KIV was still rate limiting in this strain background. 

Similar results, only with more growth and hence higher pantothenate levels, were seen 
in shake flask cultures (Table 15). 

Table 15. Pantothenate and valine production by PA340 and PA342 t both 
15 containing Y 2 (> ilvBNC in 24 hour shake flask cultures grown in SVYwith 

5 g/l fi-alanine and with or without 5 g/l a-KIV. 



Strain 


Back- 
ground 


OD600 
-a-KIV 


+ a-KIV 


[pan] g/l 
- a-KIV 


+ a-KIV 


[val] g/l 
- a-KIV 


+ a-K!V 


PA327 


PY79 


21 


22 


0.6 


3.0 


0.5 


1.3 


P A3 40-1 


PY79 


20 


20 


3.5 


4.1 


1.0 


1.9 


PA340-2 


PY79 


22 


19 


3.0 


2.1 


0.8 


1.4 


PA328 


RL-1 


20 


16 


1.4 


2.7 


0.6 


1.3 


PA342-1 


RL-1 


17 


16 


3.3 


3.6 


0.9 


1.6 


PA342-2 


RL-1 


18 


18 


3.1 


4.2 


0.8 


1.4 
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15 



20 



25 
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EXAMPLE VIII: Increasing /?a//Z> Copy Number in Strains Engineered t 

' I ■ - • . .i ..... 

Overproduce panEl and the ilvBNC ,Gene Products Enhances Pantothenate 

Production 1 ' ' ' . 1 

Experiments where p-alanine was fed to cultures of engineered B. subtilis strains 

consistently showed that ^-alanine was a rate limiting intermediate in pantothenate 

synthesis. The effect of adding additional copies pf panD on pantothenate production in 

PA340 and PA342 was examined. Strains PA340 and PA342 were transformed with 

chromosomal DNA isolated from PA401 with selection on plates containing 15 ng/ml of 

tetracycline (Tet 15 plates). Transformants derived from each parent were patched onto 

Tet 60 plates to identify those which were likely to, contain multiple copies of the 

expression cassette. Twelve transformants from each transformation which grew on 

Tet 60 were streaked for single colonies on this medium and then assayed in SVY 

medium test tube cultures for pantothenate production. One transformant from each 

group was found to produce greater than 300 mg/1 pantothenate in 24 hours. These two 

transformants were saved and named PA404 (PA340 strain background) and PA405 

(PA342 strain background). Both strains were resistant to spectinomycin, indicating that 

the P26 ilvBNC expression cassette was still present at amyE. PCR analysis of 

chromosomal DNA isolated from each strain confirmed that the deregulated panEl gene 

had also been retained. 

Next, PA404 and PA405 were evaluated in shake flask cultures which were 
grown in SVY medium containing maltose as the carbon source and supplemented with 
various intermediates. The cultures were grown for 24 and 48 hours and then assayed 
for pantothenate, p-alanine, and valine production. The results of this experiment are 
presented in Table 16. Analogous shake flask culture data for the parent strains (PA340 
and PA342) are included in the tables for comparison. 



Table 16, 



Pantothenate production by PA 4 04 and PA405 in shake flask cultures 
after 24 hours 



Strain 


Medium 
Supplements 


OD600 


Pan 

g/l 


p-AIa 

g/l 


Val 

g/l 


PA340 


none 


20 


0.4 


<0.1 


1.0 


PA404 


none 


22 


1.8 


<0.1 


0.7 


PA342 


none 


19 


0.3 


0.2 


0.7 


PA405 


none 


19 


1.4 


0.4 


0.5 















V 
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PA342* 
PA405* 
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P-alanine 
B-'alanine 

5 

P-alanine 
P-alanine^ 



18 


3.6 


3.2 


0.6 


18 


2.8 


5,1 


0.7 


17 


3.3 1 


3,3 


, 05 


19 


1.3 


6.5 


0.6 



Values are the average of .duplicate flasks except where indicated by *. 



In the absence of any medium supplementation, PA404 and PA405 made four to 
five times more pantothenate in 24 hours compared to their isogenic parent strains 
(Table 1 6). The supply of P-alanine was clearly limiting in the parent strains PA340 and 
PA342. Addition of amplified P26 panD greatly increased the supply of P-alanine. 



EXAMPLE IX: Deregulation of the B. subtilis UvD Gene Enhances Pantothenate 
Production 

10 To deregulate expression of the UvD gene, standard procedures (described ; 

i 

above) were used to integrate the constitutive P26 promoter and an artificial ribosome 
binding site, RBS2, just upstream of the UvD coding region. The UvD gene maps by 
itself, unlinked to the ilvBNC operon. First, a 2.4 kb region of the RL-1 chromosome 
that contains the UvD coding region and 730 bp of upstream sequence was cloned by 
15 PCR into a low copy (about 15 per E. coli cell) vector called pOK12, to give plasmid 
pAN257, shown in Figure 13. 

Taking advantage of a natural EcoRl site just upstream of the native UvD gene 
promoter, and a natural Ncol site at the UvD start codon, an artificial sequence 
containing P26 and RBS2 was inserted into pAN257 to give pAN263 (Figure 14). The 

20 nucleotide sequence of pAN263 is set forth as SEQ ID NO:83. In parallel with this 
construction, the cat gene was also inserted into pAN257, between the same upstream 
EcoRl site and a BgUl site in the middle of the UvD coding region, to give pAN261, 
which is deleted for a large portion of the UvD gene (Figure 15). 
Using pAN261 and pAN263, the P26 UvD cassette could then be installed in the B. 

25 subtilis chromosome in two steps. In the first step, pAN261 is introduced by 

transformation, selecting for chloramphenicol resistance, and then confirming an Ilv" 
phenotype. In the second step, pAN263 is introduced, selecting for Ilv + , checking for 
chloramphenicol sensitivity, and confirming correct local structure by PCR. 

pAN261 was first transformed into strain RL-1 (highly competent) to give strain 
30 PA343 (AilvD::caf) y and then chromosomal DNA from PA343 was used to transform 
PA340 and PA342 to Ilv* auxotrophy, yielding strains named PA348 and PA349, 
respectively. Chromosomal DNA is inherently more efficient than monomelic plasmid 
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in transforming B. subtilis. Similarly, pAN263 DNA was transformed into PA343 
(moderately competent) to give strain PA345 (P26 #vZ)), and then PA345 chromosomal 
DNA was used to transform PA348 and PA349 to Ilv + prptotrophy, yielding strains 

1 

PA374 and PA354, respectively, 

5 As predicted, PA374 and PA354 gave further increases in pantothenate 

production, to about 2.5 to 2.9 g/1. in test tube cultures grown in SVY plus 5 g/1 B- 
alanine (Table J 7). 

Table J 7. Pantothenate and valine production by PA374 and PA354, containing P 2 6 
10 ilvD, and controls, in 24 hr test tube cultures grown in SVtfwith 5 g/l fi- 

alanine and with or without 5 g/l a-KIV. 









1 


OD 60 0 


[pan] 
g/l 


[val] 
g/l 


Strain ' 




Back- 
ground 


ilvD status 


a-KIV 

1 1 


+ 


a-KIV 


+ 


a-K.IV 


+ , 


PA340 


• 


PY79 


w.t. 


T 

9.2 


9.0 


2.14 


2.23 


0.38 


0.90 


PA348 




PY79 


ilvD::cat 


11.7 


10.0 


0.19 


2.23 


0.19 


0.91 


PA374- 
PA374- 


■1 
•2 


PY79 
PY79 


P26 HvD 
P26 Hv£> 


9.1 
8.2 


7.3 
7.7 


2.93 
2.99 


2.40 
2.36 


0.58 
0.60 


0.87 
0.95 


PA342 




RL-1 


w.t. 


10.2 


8.0 


1.29 


1.89 


0.27 


0.78 


PA349 




RL-1 


ilvD::cat 


8.1 


7.7 


0.17 


1.87 


0.22 


0.88 


PA354- 
PA354- 


• 1 
•2 


RL-1 
RL-1 


P26 i?v£> 
P26 ilvD 


9.6 
7.5 


9.6 
8.2 


2.57 
2.48 


2.03 
2.24 


0.65 
0.64 


1.23 
0.97 



In the absence of added P-alanine, strains PA374 and PA354 produced only 
15 about 0.2 g/1 pantothenate in test tube cultures, indicating that PanD activity is 
significantly rate limiting. 

To alleviate this limitation, the amplifiable P26 panD cassette from strain PA401 
was installed. PA401 chromosomal DNA was transformed into PA374 and PA354, 
selecting for Tet r at 15 mg/1, to yield strains PA377 and PA365, respectively. After 
20 transformants were obtained, the strains were streaked on plates containing 30 and 60 
mg/1 tetracycline to reamplify the copy number of the P26 panD cassette integrated at the 
bpr locus. In test tube cultures grown in SVY without a-KIV or P-alanine, a substantial 
improvement in pantothenate titers over those of P A3 74 and PA354 was obtained 
(Tables 18 and 19). 
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Table 18. Pantothenate production by PA 3 65, containing ' amplified P 2 6panD, and 

controls, in 24 and 36 hr test tube cultures grown in SVY-glucose without 
p-alanine or a-KTV. 







OD 60 o 


i 

[pan] g/1 




Relevant genotype 

i 










Strain 


24 hrs. 


36 hrs 


24 hrs; ' , 


36 hrs. , 


PA342-1-,1 , 


w.t. ilvD 


11.7 1 . 


i 

* i 

8.8 


b.d. 


0.27 


PA342-J-2 


w.t. /7vZ> 

i 


12.8 


8.8 


b.d. 


0.26 


PA354-1-1 


P26 HvD 


n.d. 


,11.0 


i f 

n.d. 


0.19 


PA354-1-2 


P26 UvD 

i 


n.d. 

i 


8.4 


n.d. 


0.20 


PA365-1 


P26 HvP>> P26 panD 


9.8 


10.0 


1.01 


2.07 


PA365-2 

• i 


P.26 ilvD* P26 panD 


9.9 


10.4 


0.96 


2.09 



5 , , 

n.d. = not determined; b.d. = below detection 



Table 19. Pantothenate production by PA 3 77, containing amplified P 2 6 panD, and 

controls, in 27 hr test tube cultures grown in SVY-glucose or SVY- 
10 maltose, without a-KIV, and with or without fi-alanine. 







OD 600 


Strain 


Relevant genotype 


- P-ala 
Glucose 


+ P-ala 
Glucose 


- p-ala 
Maltose 


+ p-ala 
Maltose 


P A3 74-1 
PA374-2 


P26 ilvD 
P26 UvD 


9.4 
9.2 


9.8 
9.6 


7.0 
6.6 


6.4 
6.3 


P A3 77-1 
P A3 77-2 


?26 HvD, P26 panD 
P26 ilvD, P26panD 


10.0 
10.5 

[pan] g/1 


7.6 
7.8 


7.2 
9.4 


6.1 
5.4 


Strain 


Relevant genotype 


- p-ala 
Glucose 


+ p-ala 
Glucose 


- p-ala 
Maltose 


+ p-ala 
Maltose 


PA374-1 
PA374-2 


P26 ilvD 
P26 ilvD 


0.04 
0.10 


2.76 
2.65 


0.14 
0.15 


1.31 
1.33 


PA377-1 
PA377-2 


P26 ilvD, P26 panD 
P26 HvD, P 26 panD 


1.25 
1.25 


2.76 
2.35 


1.26 
1.31 


1.10 
1.26 



In SVY with glucose, an increase in pantothenate production can still be 
15 achieved by feeding 5 g/1 p-alanine suggesting that increasing panD expression further 
might increase pantothenate production. In SVY with maltose, no further increase in 
pantothenate was obtained by feeding p-alanine suggesting that p-alanine and/or 
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aspartate synthesis is suppressed by glucose. Strains PA377 and PA365 have been 
evaluated in 1 0 liter fermentors, where they typically produce above 20 g/l pantothenate 
in 48 hours without supplemental ^-alanine and a-KIV br, valine, described hi detail 
below. 

5 ' ! 

EXAMPLE X: 10 liter Fermentations of Pantothehate-Producing Microbes 

1 , ■ 1 ■ , 

Engineering of the P26 UvBNC and P26 HvD cassettes to give strains PA342 and 
1 PA354 allowed the production of 22 and 26 g/l of pantothenate, respectively, without 

the addition of valine or a-KIV to the fermentation medium (Table 20). At 48 hours, 

I 

10 both strains had secreted about 0.5 g/l of valine into the medium. 

! 

i 

Table 20. J 0-liter fermentations of Jive pantothenate overproducing strains. 



Strain 


Medium 


Feed 
40% Glucose 
plus 


1 

1 

• OD 600 
48 hr 


Valine 
48 hours 

g/l 


P-ala 
48 hr 

g/l 


Pantothenate 

g/L : 

36 hr 48 hr 72 hr 


PA 236 


SVYG 


50 g/l p-ala 
25 g/l a-KIV 


108 


added 


. added 


16 


19 


21 


PA 342 


SVYG 


50 g/l p-ala 


92 


0.5 


added 


17 


22 




PA 354 


SVYG 


50 g/l P-ala 


90 


0.5 


added 


19 


26 




PA 365 


SVYG 


25g/l YE 


77 


0.85 


0.4 


18 


21 


27 


PA 377 


SVYG 


25g/l YE 


85 


1.5 


0.5 


18 


22 


31 


PA 377 


PFMG 


25g/l YE 


96 


0.8 


0.4 


19 


25 


29 


PA377 


PFMG 




71 


0.7 


0.1 


16 


21 





15 Pantothenate synthesis in fermentors 

With the addition of the P26 panD cassette to strains PA354 and PA374 to create 
strains PA365 and PA377, neither {3-aIanine nor a-KIV needed to be added to the 
fermentors. Strain PA365 produced 21 g/l pantothenate in 48 hours and 27 g/l in 72 
hours with no precursors added to the medium (Table 20). PA377 was somewhat better, 
20 producing 18 g/l of pantothenate in 36 hours, 22 g/l in 48 hours, and 31 g/l in 72 hours). 
Valine was measured at 0.85 and 1.5 g/l for strains PA365 and PA377, respectively, at 
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48 hours in SVYG medium. Strain PA377 maintained valine between 1-1.5 g/1 
throughout most of the fermentation and P-alanine between 0.2 and 0.5 jg/L 

i . i 

Strain PA377 was further evaluated in 10-liter fermentors in yeast extract based 
PFMG medium. Pantothenate yields in PFMG and SVYG medium were similar. In 
5 PFMG, PA377 produced 19 g/1 of pantothenate in 36 hours, 25 g/1 in 48 hours, and 29 
g/1 in 72 hours. In SVYG, PA377 produced 18 g/L pantothenate in 36 hours,, 22 g/L in 
48 hours and 3 1 g/L in 72 hours (Table 20). 

• ' 1 i ' 

j ' 

EXAMPLE XI: Converting Strain PA377 to a Tryptophan Prototroph 

10 

PA377 (Trp~) was transformed to Trp + using chromosomal DNA from PY79 to 
give strain PA824. After re-amplification of the P26pcinD casette, PA 824 was compared 
to PA377 for pantothenate production in lest tube cultures grown in S VY glucose with 
or without 5 g/L P-alanine (Table 2l). , 

15 

Table 21 : Trp* derivatives of PA377: Pantothenate production in 48 hour test tube 

cultures grown in SVY glucose, ±p-alanine 



Strain 


trpC donor 


OD 60 o 


[pan] g/L 


- p-alanine 


+ P-alanine 


- p-alanine 


+ p-alanine 


PA377-1 


RL-1 


8 


8 


1.5 


3.4 


PA377-2 


RL-1 


8 


9 


1.6 


3.6 


PA824-1 


PY79 


12 


10 


0.7 


3.7 


PA824-2 


PY79 


11 


11 


1.9 


4.9 



20 The Trp+ strains grew to slightly higher densities than PA377. In the absence of 

exogenous P-alanine, all of the strains produced similar levels of pantothenate, while 
with the addition of p-alanine 5 the Trp+ derivatives produced somewhat more 
pantothenate. 

25 Ferment or studies with PA824 

PA824 was evaluated in CF3000 Chemap 14 liter vessels with 10 liter working 
volumes. Formulations for two of the media used in the fermentors are given in Tables 
22 and 23. 
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Table 22 : 





MATERIAL 


g/L (final ID 


1 


Amberex 1003 


10 


2 


Na Glutamate 


. 5 1 


3 


(NH 4 ) 2 S0 4 , , 


! * 1 

8 


* 


• * 


1 

, 1 


4 


MAZUDF37C 


2.5 




i 




Added After Sterilization and Coo 


jjown 




i 




1' 1 


KH 2 P0 4 


1 in 


2 


K 2 HP0 4 -3H 2 O 


on 








1 


Glucose 




2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 Q 


qs to 6000 ml 


FEED 




MATERIAL. 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 0 


qs to 3000 ml 
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Formulation for PFMG-5 medium 



BATCH 
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Table 23 : Formulationjor SVY-4 medium 



BATCH 





MATERIAL 


g/L (final []) 


1 


Veal Infusion 

■ ' 


• ■ ' 25 


2 


Yeast Extract 


<t . ■ ■ ■ i 

. 5 1 


, 3 ., , 


Na Glutamate i 


■ 5 


4. 


(NH 4 ) 2 S0 4 


4 




> 




5 


MAZU DF 37C 

1 * 


2 5 




\ 

1 

i 




Added After Sterilization and Coo 


Down 




■ 


1 


1 


KH 2 P0 4 


10 


2 


K 2 HP0 4 -3H 2 0 


20 








1 


Glucose ' 


20 


2 


MgCl 2 -6H 2 0 


1 


3 


CaCl 2 -2H 2 0 


0.1 

1 








1 


Sodium Citrate 


1 


2 


FeS0 4 -7H 2 0 


0.01 


3 


SM-1000X 


1.0 ml 










H 2 Q 


qs to 6000 ml 


FEED 




MATERIAL 


g/L 


1 


Glucose 


600 


2 


CaCl 2 -2H 2 0 


0.6 










H 2 Q 


qs to 3000 ml 



All fermentations were glucose limited fed batch processes. Immediately after 
inoculation, agitation was set at 200 rpm. The initial batched 2% glucose was consumed 
10 during exponential growth. Afterwards, glucose concentrations were maintained 
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between 0.2 and 1 .0 g/L by continuous feeding of a 60% glucose solution, The variable , 

rate feed pump was computer controlled and linked to the dissolved oxygejn 

f , t 

concentration [pCh] in the tank by an algorithm. When the [p02]fell to 30%, computer 
control began to automatically adjust the agitation rate to maintain a dissolved oxygen 
5 concentration between 25 and 30% [p02]. Computer control and data recording were by 

Braun MFCS software. 

< . | 

■ In one study, PA284 was grown in fermentors at two temperatures (40°C and , 

I ■ . 

43 °C) in the medium described in: Table 22. Results of two experiments demonstrated 

i 1 I • 

that the highest pantothenate titers at early time points were produced at 43°C. The cell 
10 mass approached 150 optical density units at ODeoo and 56 hours at 43 °C, and the 

pantothenate titers were 21 g/L, 28 g/L and 36 g/L at 36, 48 and 72 hours respectively. 
In the parallel fermentation at 4Q°C, the cell mass approached 120 optical density units 

4- 

at ODeoo and 56 hours, and the pantothenate titers were 18 g/L, 26 g/L and 37 g/L at 36, 
48 and 72 hours, respectively. | 

15 In another study, PA824 was grown in a fermentor at 43°C in the medium i 

described in Table 23. The cell mass exceeded 160 optical density units at OD600 and 36 
hours, and the pantothenate titers were 23 g/L, 34 g/L, 37 g/L and 40 g/L at 24, 36, 48 
and 60 hours, respectively. In other fermentations, increasing the amount of trace 
elements in the glucose feed (e.g., increasing the concentration of SM from IX to 2X) 

20 resulted in even higher titers of pantothenate. 

EXAMPLE XII: Identification and characterization of the B. subtilis coaA gene 
product 

The annotated version of the B. subtilis genome sequence available on the 
25 "Subtilist" web site contains no gene labeled as coaA. However a homology search 
using the protein sequence of E. coli pantothenate kinase as a query sequence gave a 
good match with B. subtilis geneyq/S, which is annotated as "unknown; similar to 
pantothenate kinase." This gene appears to be the penultimate gene in an operon 
containing five open reading frames (Figure 18). Two of the open reading frames 
30 encode proteins which are similar to D-serine dehydratase and to "ketoacyl reductase"; 
the other two have no known homologies. For the open reading frame corresponding to 
coaA, there are three possible start codons; each having a possible ribosome-binding site 
(RBS) associated with it. The three potential coaA ORFs were named coaA J t coaA2, 
and coaA3, from longest to shortest. 
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, All three potential coaA open reading frames were cloned along, with their 

respective RBSs by PCR followed by ligation into expression plasmid pAN229. 
pAN229 is a low copy vector in E. coli that provides expression from the SP01 phage 
P/s promoter and can integrate by, single crossover at bpr with tetracycline selection. A 
5 representative resulting plasmid, pAN281, is shown in Figure 19: 

I 

> ' I 

To determine if the cloned; putative coaA ORFs actually encode a pantothenate 
kinase activity, several isolates of jail three plasmids were transformed into the E. coli ' 
■ strain YH1, that contains the coaA15(Ts) allele. Transformants were streaked to plates 
incubated at 30° and 43°C to test for complementation of the temperature sensitive 
10 allele. All isolates of all three cociA variants, except for one isolate of pAN282, 

* i 

complemented well at 43 °C, indicating that all three plasmid constructs encode an active 
pantothenate kinase. Accordingly, it can be concluded that the B. subtilisyqjS open 

* ■ * 

reading frame codes for an active pantothenate kinase. 

! ■ 

15 EXAMPLE XIII: Deletion of the coaA gene from the B. subtilis genome ! 

The coaA gene of B. subtilis (yq/S) was deleted from the chromosome of a B. 
subtilis strain by conventional means. The majority of the coaA coding sequence was 
deleted from a plasmid clone and replaced by a chloramphenicol resistance gene (cat), 
while leaving approximately 1 kb of upstream and downstream sequence to allow 

20 homologous recombination within the chromosome, to give plasmid pAN296 (see 

Figure 17). pAN296 was then used to transform a B. subtilis strain (PY79), selecting for 
chloramphenicol resistance. The majority of transformants result from a double 
crossover event that effectively substitutes the cat gene for the coaA gene. The 
transformed strain containing the coaA deletion - cat insertion grew normally due the 

25 presence of a second B. subtilis pantothenate kinase encoding gene described herein. 

EXAMPLE XIV: Identification and characterization of a second B. subtilis gene 
encoding pantothenate kinase activity 

As described in detail in the instant specification, in order to maximize 
30 pantothenate production, it is necessary to restrict the flow of pantothenate toward 
Coenzyme A (CoA), for example, by reducing the activity of pantothenate kinase, the 
first enzyme in the pathway from pantothenate to CoA. After finding that deletion of 
the coaA gene from the chromosome of B. subtilis is not a lethal event (see Example 
XIII), it was concluded that B. subtilis must contain a second gene that encodes an active 
35 pantothenate kinase, since pantothenate kinase is an essential enzyme activity. 
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A second pantothenate kinase-encoding gene was identified by complementing 
the E. coli strain YH1 (coaAI5(Ts)) with a A subtilis gene bank and selecting for 
transformants that were able to grow at 43°C. Found among the transformants were two 
families of plasmids that had overlapping restriction maps within each family, but not 
5 between the families. As expected, the restriction map of one family was identical to 
that predicted from the B. subtilis genome sequence for the homologue of the E. coli 

coaA gene (which we named coaA also, see above) and surrounding sequences. The 

i * i . 

other family had a restriction map that was completely non-overlapping with the first. 

DNA sequencing of the ends of the cloned , inserts from the second family 
10 showed that the clones came from a region of the B. subtilis chromosome that includes 
the 3' end of the ftsH gene, the 5' end of the sul gene, and all of the yacB, yacQ yacD, 
cysK, pabB 9 pabA and pabC genes. None of the open reading frames of these cloned 
inserts showed horhology to any known pantothenate kinase sequences, either 
prokaryotic or eukaryotic. 

15 Several deletions were created through the B, subtilis genomic sequences in the 

cloned inserts. Each deletion was tested for complementation of the E. coli temperature 
sensitive pantothenate kinase. In particular, a deletion that removed all DNA between a 
Stu I site in the cloning vector and a Swa I site in the yacC gene, leaves yacB as the only 
intact open reading frame in the cloned insert (see Figure 21). This deleted plasmid still 

20 complemented the E. coli pantothenate kinase mutant. However, another deletion that 
removed DNA from the Swa I site in yacC through a Bst\ 1071 site in the (already 
truncated) ftsH gene, could not complement the E. coli pantothenate kinase mutant. 
From these results, it was concluded that the yacB open reading frame was responsible 
for the complementation activity. To confirm that yacB is a pantothenate kinase gene, 

25 XhzyacB ORF plus 112 base pairs of downstream flanking sequence was amplified by 
PCR in two independent reactions and cloned downstream of a constitutive promote to 
give plasmids pAN341 and pAN342 (Figure 22). Both pAN341 and pAN342 
complemented the defect in YH1 at 44°C, while a control plasmid, which has the same 
backbone, but expresses panBCD instead of yacB did not. This confirmed that the yacB 

30 open reading frame was responsible for the complementation of YH1. 

As such, a novel gene that encodes pantothenate kinase activity in B. subtilis has 
been discovered that is not related by homology to any previously known pantothenate 
kinase gene. This gene has been renamed coaX, as a second, alternative gene that 
encodes an enzyme that catalyzes the first step in the pathway from pantothenate to 
35 CoaA. Deletion of coaX by methods described above for deleting coaA, in conjunction 
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with reduction in the activity of the CoaA enzyme, provides a means to, reduce, 

pantothenate kinase activity to the desired level. - 1 , 

i » 

Several homologues of the B. subtilis coaX gene were identified by homology 
searching of various publically available databases using the published yacB (coaX) 
5 open reading frame sequence and predicted amino acid sequence (as set forth in SEQ ID 
NOs:84 and 85 respectively). In two cases (Mycobacterium tuberculosis and 
Streptomyces coelicolor) the homologous coaX genes are adjacent to, or almost adjacent 
. to, pantothenate biosynthetic gene's, consistent with these homologs having a role in 
pantothenate metabolism. The CoaX proteins show no homology to the CoaA family of 
10 pantothenate kinases, nor to the eiikaryotic family of pantothenate kinases exemplified 
by PanK of Saccharomyces cerevisiae. 

i * 

Alignment of the amino acid sequences of several bacterial CoaX homologs with 
the amino acid sequence predicted from translating the B. subtilis yacB ORF described 
in the published B. subtilis genome sequence revealed that the CoaX proteins from other 

i 

15 bacteria contained additional amino acid residues at their carboxy-terminal ends. 
Moreover, these extensions beyond the end of the predicted amino acid for the B. 
subtilis gene product contained two relatively well conserved segments of sequence. 

Translation of nucleotide sequences just downstream from the stop codon of the 
B. subtilis yacB ORF in a different reading frame revealed the existence of amino acid 
20 sequences very similar to the carboxy-terminal extensions of the other bacterial CoaX 
proteins. It is thus believed that an error exists in the published DNA sequence of the B. 
subtilis yacB ORF sequence that causes a frame shift leading to an artifactual 
downstream amino acid sequence and premature termination. 

The PCR-generated sequences of B. subtilis CoaX in pAN341 and pAN342 
25 (described above) contain enough downstream flanking sequence to encode the putative 
carboxy-terminal extension described above, which is consistent with the result that the 
clones were functional in the complementation assay. However when the 3' PCR 
primer was positioned to include only the shorter yacB ORF predicted from the 
published sequence, but not to include the putative carboxy-terminal extension, then the 
30 resulting plasmids, pAN329 and pAN330 (similar in structure to pAN341 and pAN342; 
see Figure 22), did not complement the defect in YH1 . This result supports the notion 
that the published yacB coding sequence contains a frame-shift error, and that the 
carboxy-terminal end of CoaX is necessary for pantothenate kinase activity. The 
predicted correct nucleotide sequence for B. subtilis coaXxs set forth as SEQ ID NO: 19 
35 and the translated amino acid sequence is set forth as SEQ ID NO:9. A multiple 
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EXAMPLE XVIII: Generation of mutant coaX genes encoding pantothenate 
kinase having reduced or temperature sensitive activities ' 

Mutant coaX genes are generated by introducing point mutations into the gene 
and testing the resulting mutants for the ability to complement the E. coli YH1 strain as 
5 described in Example XII. Preferred mutations in the coaX gene seqeunces are those 
that encode a substitution of a residue conserved among CoaX sequences from a variety 
of bacterial sources (e.g., a conserved residue set forth in Figure 23); Alternatively, 
random mutations in the coaXgene sequence are generated by mutagenic PCR and in 
vitro recombination and identified by screening for alleles that poorly complement the 
10 E. coli coaA 1 5(Ts} mutant. 

i i ■ 

Mutants so generated (i.e., mutants having reduced coaX activity) can be further 
engineered such that the endogenous coaA gene is deleted (as described in Example 
XIII). CoaX reduced-activity mutants can also be further engineered to contain reduced- 
activity CoaA gene products as described in Example XV. 

15 

EXAMPLE XIX: Enhanced Production of Panto-Compounds Using Bacteria 
Having Deletions in One or More Pantothenate Biosynthetic Enzymes 

If the desired panto-compound is not pantothenate, then an appropriate deletion 

i ■ 

20 of one or more of the pantothenate biosynthetic genes from a pantothenate 

overproducing strain will provide a strain that produces said desired panto-compound. 
In this example, the desired panto-compound is pantoate. Starting with, for example, 
strain PA236, PA3 13 or PA824 either one or both of the panC and pariD genes is 
deleted. In another example, ketopantoate is the desired panto-compound. Starting 

25 with, for example strain PA244, PA245 or PA824 one, two or all of the HvC y panEl, 
panC and pariD genes are deleted from the starting strain. If P-alanine is the desired 
panto-compound, then panB and panC can be deleted, preferably in a fashion that leaves 
an in frame fusion of a small portion of the 5' end of pariB with a small portion of the 
3' end ofpanC, from the strain PA221, PA235, PA245, or PA313. In all of the above- 

30 mentioned examples, the panto-compound producing strain will be a pantothenate 
auxotroph. Accordingly, the growth medium requires sufficient pantothenate for 
adequate growth. Vectors designed to overexpress panD as described above are then 
transformed into the above strains to further enhance p-alanine production. 

The above-mentioned deletions are accomplished by methods well-known to 

35 those skilled in the art, for example, by insertion of an antibiotic resistance gene and 
removing sufficient sequence from the target gene(s) to inactivate said target gene(s). 



WO 01/21772 



PCT/US00/25993 



- 100- 



Alternatively, removal of targeted sequences is accomplished without simultaneous 
introduction of an antibiotic resistance' gene in said target gene and then introduced by 
congression (co-transformation with any other appropriate selectable DNA sequence) 

followed by screening for the loss of function of said target gene by replica plating. 

i . • , * 

Table 24 : Strains (and corresponding phenotypes) for panto-compound production 



Name 


Pheno 


Drug 


1 panBCD 






I 




PA221 


Trp- 




P26panBCD 


PA222 






P /s panBCD 


PA235 






P26panBCD 


PA236 






P 26 panBCD 


PA327 


Trp- 


• 


P26panBCD 


PA328 


M 1 1 t 

* 1 r 




P26panBCD 




irp- 


opc 

1 


P Ofinsm /?/""*/") 

r zopunijK-, lj 


PA342 


Trp- 


Spc 


P26panBCD 


PA354 


Trp- 


Spc 


P26panBCD 


PA365 


Trp- 


Spc, 


P26panBCD 






Tet 




PA374 


Trp- 


Spc 


P26panBCD 


PA377 


Trp- 


Spc, 


P26panBCD 






Tet 




PA40I 


Trp- 




P26panBCD 


PA402 


Trp- 




P26panBCD 


PA403 


Trp- 




P26panBCD 


PA404 


Trp- 


Spc, 


P26panBCD 






Tet 




PA405 


Trp- 


Spc, 


P26panBCD 






Tet 




PA651 


Trp- 


Spc 


P26panBC*D 


PA284 




Spc, 


P26 panBCD 



panE locus 



itvD : amyE locus 
locus 



P26 panE! 
P26pariEJ 
P26panEi 
P26panEl 
P26panEl 



P26ilvBNC 
P26ilvBNC 



P26panEl P26ihD P26ilvBNC 

P26panEl P26ilvD P26iIvBNC 

P26panEl P26itvD P26ilvBNC 

P26panEl P26ilvD P26UvBNC 



P26panEl 
P26panEJ 



P26ilvBNC 
P26ilvBNC 



bpr locus Parent 



RJL-1 

PA221 
PA221 
PA235 
PA327 
PA328 
PA342 
PA354 



P26panD423 

P26panD423 

P26panD423 
P26panD428 
P26panD429 
P26panD423 



PA340 
PA374 

PA221 
PA221 
PA22I 
PA340 



P26panD423 PA342 



P26panD423 



PA374 
PA377 



Tet 



Equivalents Those skilled in the art will recognize, or be able to ascertain using no 
10 more than routine experimentation, many equivalents to the specific embodiments of the 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 
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What is claimed: 
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I 

1 . A method of producing a panto-compound comprising culturing a 
microorganism which overexpresses at least one Bacillus pantothenate biosynthetic 

5 enzyme under conditions such that the panto-compound is produced. 

1 ' 1 

2. The method of claim 1 , wherein the microorganism overexpresses at least 

1 1 • ■ 1 

one Bacillus subtil is pantothenate biosynthetic enzyme. 

1 ' 1 I ■' ' ' ■ 

! i 

'■ I r i 

10 3. The method of claim 1 or 2, wherein the pantothenate biosynthetic 

enzyme is selected from the group consisting of ketopantoate hydroxymethyltransferase, 
pantothenate synthetase, aspartate-a-decarboxylase and ketopantoate reductase. 

i 

i 

4. The method of any: one of claims 1 to 3, wherein the microorganism 
1 5 overexpresses at least two pantothenate biosynthetic enzymes. 

i 

5. The method of any one of claims 1 to 3, wherein the microorganism 
overexpresses at least three pantothenate biosynthetic enzymes. 

20 6. The method of any one of claims 1 to 5, wherein the panto-compound is 

selected from the group consisting of pantothenate, pantoate, ketopantoate and P- 
alanine. 

7. A method of producing a panto-compound comprising culturing a 

25 ketopantoate reductase-overexpressing (KPAR-O) microorganism under conditions such 
that the panto-compound is produced. 

8. The method of claim 7, wherein the panto-compound is pantothenate or 
pantoate. 

30 

9. The method of claim 7 or 8, wherein the ketopantoate reductase is 
bacterial-derived. 



10. The method of claim 7 or 8, wherein the ketopantoate reductase is 
35 derived from Bacillus. 
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11. The method of claim 7 or 8, wherein the ketopantoate reductase is 
derived from Bacillus subtilis. 1 1 • 

• ■ i 

i * . , , i 

i ■ ■ 

) * 

12. The method of any one of claims 7 to 11, wherein the KPAR-O 

5 microorganism further overexpresses at least one pantothenate biosynthetic enzyme in 
addition to overexpressing ketopantoate reductase,' ' 

13. The method of claim 12, wherein the KPAR-O microorganism further 
overexpresses at least one of ketopantoate hydroxymethyltransferase, pantothenate 

10 synthetase and aspartate-a-decarboxylase. 



i 



14. A method of producing pantothenate in a manner independent of 

i 

precursor feed comprising culturing an aspartate-a-decarboxylase-overexpressing (AaD- 
O) microorganism having a deregulated isoleucine-valine (ilv) pathway under conditions 
1 5 such that pantothenate is produced. 



v 



1 5. A method of producing at least 2 g/L pantothenate in a manner 
independent of aspartate or P-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

20 pantothenate is produced. 

16. A method of producing at least 2 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (/7v) biosynthetic pathway under conditions such 

25 that pantothenate is produced. 

17. A method of producing at least 30 g/L pantothenate in a manner 
independent of aspartate or P-alanine feed comprising culturing an aspartate-a- 
decarboxylase-overexpressing (AaD-O) microorganism under conditions such that 

30 pantothenate is produced. 

1 8. A method of producing at least 30 g/L pantothenate in a manner 
independent of valine or a-ketoisovalerate feed comprising culturing a microorganism 
having a deregulated isoleucine-valine (/7v) biosynthetic pathway under conditions such 

35 that pantothenate is produced. 
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19. A p-alanine independent high yield production method for producing 
pantothenate comprising culturing a manipulated microorganism under condition^ such 
that pantothenate is produced at a significantly high yield., . ' . 

i 

5 20. The method of any one of claims 14 to 19, wherein the microorganism 

overexpresses acetohydroxyacid synthetase or is transformed with a vector comprising 
an ilvBN nucleic acid sequence or Ian qlsS sequence \. , , ' ( 

i 1 * | 

21. The method of any one of claims 14 to 19, wherein the microorganism 

10 overexpresses acetohydroxyacid isomeroreductase or is transformed with a vector 
comprising an ilvC nucleic acid sequence. 

22. The method of any one of claims 1 4 to 19, wherein the microorganism 

overexpresses dihydroxyacid dehydratase or is transformed with a vector comprising an 

i ' 

1 5 ilvD nucleic acid sequence. , 

i 

23. The method of any one of claims 19 to 22, wherein the microorganism 
overexpresses aspartate-a-decarboxylase or is transformed with a vector comprising a 
panD nucleic acid sequence. 

20 

24. The method of any one of claims 14 to 23, wherein the microorganism 
further has a deregulated pantothenate biosynthetic pathway. 

25. The method of any one of claims 14 to 24, wherein the microorganism 
25 further has at least one mutant gene selected from the group consisting of a mutant avtA 

gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

26. The method of claim 24, wherein the microorganism overexpresses any 
of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 

30 synthetase and aspartate-a-decarboxylase. 

27. The method of claim 24 or 26, wherein the microorganism is transformed 
with a vector comprising a panBCD nucleic acid sequence or a vector comprising a 
panEl nucleic acid sequence. 

35 
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28. The. method of any one of claims 14 to 16, and 1 9 to 27, wherein 

• 1 1 • i 

pantothenate is produced at a level selected from the group consisting of a level greater 
than lOg/L, a level greater than 20g/L and a level greater than 40g/L. 

29. The method of claim 20, wherein the microprganism overexpresses 
acetohydroxyacid synthetase derived from Bacillus or is transformed with a vector 
comprising an ilvBN nucleic acid sequence or an alsS nucleic acid sequence derived i 
from Bacillus, 



10 30. The method of claim 21, wherein the microorganism overexpresses 

acetohydroxyacid isomeroreductase derived from Bacillus or is transformed with a 
vector comprising an ilvC nucleic acid sequence derived from Bacillus. 

31. The method of claim 22, wherein the microorganism overexpresses 

t 1 

15 dihydroxyacid dehydratase derived from Bacillus or is transformed with a vector 
comprising av ilvD nucleic acid sequence derived from Bacillus. 

32. The method of claim 23, wherein the microorganism overexpresses 
aspartate-a-decarboxylase derived from Bacillus or is transformed with a vector 

20 comprising a panD nucleic acid sequence derived from Bacillus. 



33. The method of claim 24 or 26, wherein the microorganism overexpresses 
any of ketopantoate hydroxymethyltransferase, ketopantoate reductase, pantothenate 
synthetase and aspartate-a-decarboxylase derived from Bacillus. 

25 

34. The method of claim 27, wherein the vector comprises a panBCD nucleic 
acid sequence or a panEl nucleic acid sequence derived from Bacillus. 

35. A method of producing a panto-compound comprising contacting a 
30 composition comprising at least one pantothenate biosynthesis pathway precursor or 

isoleucine-valine biosynthesis pathway precursor with at least one isolated Bacillus 
enzyme selected from the group consisting of ketopantoate hydroxymethyltransferase, 
ketopantoate reductase, pantothenate synthetase and aspartate-a-decarboxylase, under 
conditions such that the panto-compound is produced. 

35 
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36. A rpethod of producing p-alanine comprising culturing an aspartate-a- 

decarboxylase-overexpressing (AaD-O) microorganism under conditions such that p- 

■ i < i 

alanine is produced. 1 * 

5 37. The method of claim 36, wherein the AaD-O microorganism has a 

mutation in a nucleic acid sequence encoding a pantothenate bio^ynthetic enzyme 
selected from the group consisting of ketopantoate hydroxymethyltransferase, 

ketopantoate reductase and pantothenate synthetase. 

i 1 ■ 

10 38. A method of producing p-alanihe comprising contacting a composition 

comprising aspartate with an isolated Bacillus aspartate-a-decarboxylase enzyme under 
conditions such that p-alanine is produced. 

39. A method for enhancing production of a panto-compound comprising 

15 culturing a mutant microorganism having a mutant coaX gene under conditions such that 
the panto-compound production is enhanced. 

40. The method of claim 39, wherein said recombinant microorganism has a 
mutant coaA gene. 

20 

41 . A method of producing a panto-compound comprising a pantothenate 
kinase mutant microorganism under conditions such that the panto-compound is 
produced at a significantly high yield. 

25 42. The method of claim 4 1 , wherein said mutant microorganism has a 

mutant coaA gene. 

43. The method of claim 41, wherein said mutant microorganism has a 
mutant coaX gene. 

30 

44. The method of claim 41, where said mutant microorganism has a mutant 
coaA and coaX gene. 

45. The method of any one of claims 39 to 44, wherein said panto-compound 
35 is selected from the group consisting of ketopantoate, pantoate or pantothenate. 
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46. The method of any one of claims, 39 to 44, wherein said panto-compound 

■ ' t ■ •,.(■.■ 

is pantothenate. 1 ' 1 . 

' 1 ■ ,' 

i ■ 

47. The method of any one of claims 39 to 44, wherein said panto-compound 
5 is produced at a level selected from the group consisting of a level greater than 1 Og/L, a 

level greater than 20g/L and a level greater than 40g/L. ' 

1 j 1 i ' , ■ ■ 

48. The method of any one of claims 39 to 44, wherein said recombinant 
microorganism further has a deregulated pantothenate biosynthetic pathway or further 

10 has a deregulated isoleucine-valine (ilv) biosynthetic pathway. 

49. The method of claim any one of claims 39 to 44, wherein said 
i ■ , ■ 

recombinant microorganism further overexpresses panD and panE. 

1 5 50. The method of any one of claims 39 to 44, wherein said recombinant 

microorganism further has at least one mutant gene selected from the group consisting 
of a mutant avtA gene, a mutant ilvE gene, a mutant ansB gene and a mutant alsD gene. 

51 . A method for enhancing production of a panto-compound comprising 

* 

20 culturing a microorganism that has a deregulated pantothenate biosynthetic pathway and 
that also has a mutation that results in reduced pantothenate kinase activity under 
conditions such that the panto-compound production is enhanced. 

52. A method for identifying compounds which modulate pantothenate 

25 kinase activity comprising contacting a recombinant cell expressing pantothenate kinase 
encoded by the coaX gene with a test compound and determining the ability of the test 
compound to modulate pantothenate kinase activity in said cell. 

53. The method of claim 52, wherein said cell further comprises a mutant 
30 coaA gene encoding a pantothenate kinase having reduced activity. 

54. The method of any one of claims 1 to 51, wherein the microorganism is 
Gram positive. 

35 55. The method of any one of claims 1 to 51 , wherein the microorganism is 

Gram negative. 
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56. The method of any one of claims 1 to 51, wherein the microorganism is a 

microorganism belonging to a genus selected from the group consisting of Bacillus, 

i , i 

Corny ebacterium y Lactobacillus, Lactococci and Streptomyces. 1 

5 57. The method of any one of claims 1 to 51 and 54 to 56, wherein the 

microorganism is of the genus Bacillus. 

58. The method of any one of claims 1 to 51 and 54 to 57, wherein the 
microorganism is Bacillus subtilis. 

io i 

59. The method of any one of claims 1 to 13, 35, 39 to 51 and 54 to 58, 
further comprising recovering the panto-compound. 

i - + 

60. The method of anyj one of claims 14 to 34 and 54 to 58, further 
1 5 comprising recovering the pantothenate. . 

61 . The method of any one of claims 1 to 14, 35 , 39 to 46, 48 to 5 1 and 54 to 
59, wherein the panto-compound is produced at a level greater than 2 g/L. 

20 62. A recombinant microorganism which overexpresses at least one Bacillus 

pantothenate biosynthetic enzyme. 

63. The recombinant microorganism of claim 62, which overexpresses at 
least one Bacillus subtilis pantothenate biosynthetic enzyme. 

64. The recombinant microorganism of claim 62 or 63, wherein the 
pantothenate biosynthetic enzyme is selected from the group consisting of ketopantoate 
hydroxymethyltransferase, pantothenate synthetase, aspartate-a-decarboxylase and 
ketopantoate reductase. 

65. The recombinant microorganism of any one of claims 62 to 64, wherein 
the pantothenate biosynthetic enzyme is ketopantoate reductase. 

66. A recombinant microorganism which overexpresses aspartate-a- 
decarboxylase and has a deregulated isoleucine-valine (z7v) biosynthetic pathway. 



25 



30 



35 
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, 67. A recombinant microorganism haying a mutant coaA'' gene*, said mutant , 

coaX gene encoding reduced pantothenate kinase activity in said microorganism. 

68. The recombinant microorganism of claim 67 further having a mutant 
5 coaA gene, said mutant coaA gene encoding reduced pantothenate kinase activity in said 
microorganism. 

i 

, 69. A recombinant microorganism having a mutant coaX gene and optionally 
having a mutant coaA gene, said mutant microorganism having reduced pantothenate 
10 kinase activity as compared to a microorganism having wild-type coaA and coaX genes. 
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70. A recombinant microorganism comprising a vector comprising an 
isolated coaX gene. 

! 1 

15 71. A recombinant microorganism that overproduces a panto-compound, the 

microorganism having a deregulated pantothenate biosynthetic pathway and having at 
least one mutation that results in a decrease in the capacity of the microorganism to 
synthesize Coenzyme A (CoA). 

20 72. The recombinant microorganism of claim 71 , having at least one 

mutation that results in a reduced level of pantothenate kinase activity. 

73. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, that results in a reduced level of CoaA enzyme 

25 activity. 

74. The recombinant microorganism of claim 72, having a mutation in a 
coaX gene, or homologue thereof, that results in a reduced level of CoaX enzyme 
activity. 



75. The recombinant microorganism of claim 72, having a mutation in a 
coaA gene, or homologue thereof, and having a mutation in a coaX gene, or homologue 
thereof, the mutations resulting in reduced levels of CoaA enzyme activity and reduced 
CoaX enzyme activity. 

76. The recombinant microorganism of any one of claims 66 to 70 which 
further has a deregulated pantothenate biosynthetic pathway. 
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ii' i 

i 

77. The recombinant microorganism of $ny one of claims' 62 to 65 and 67 to 

75, further having a deregulated isoleucine-valine (/7v) biosynthetic pathway. 

i * 

i 

i 

5 78. The recombinant microorganism of any one of claims 62 to 77, which is 

Gram positive. t , ' ' ' S 

1 X 1 * 1 • 

79. The recombinant microorganism of claim 78 belonging to a genus 

selected from the group consisting of Bacillus, Cornyebacterium> Lactobacillus, 
10 Lactococci axidStreptomyces. 

i i 

80. The recombinant microorganism of claim 79 belonging to the genus 
Bacillus. 1 

15 81 . The recombinant microorganism of claim 80 which is Bacillus subtilis. 

i 

82. A recombinant microorganism selected from the group consisting of 
PA221, PA235, PA236, PA313, PA410, PA402, PA403, PA411, PA412, PA413, 
PA303, PA327, PA328, PA401, PA340, PA342, PA404, PA405, PA374, PA354, 

20 PA365, PA377, PA65 1 and PA824. 

83. A recombinant vector for use in the production of panto-compounds 
comprising a nucleic acid sequence which encodes at least one Bacillus pantothenate 
biosynthetic enzyme operably linked to regulatory sequences. 

25 

84. The vector of claim 83, comprising a nucleic acid sequence which 
encodes at least one Bacillus subtilis pantothenate biosynthetic enzyme. 

85. The vector of claim 84, wherein the nucleic acid sequence encodes at 
30 least one of ketopantoate hydroxymethyltransferase, pantothenate synthetase, aspartate- 

a-decarboxylase and ketopantoate reductase. 
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86. A recombinant vector comprising at least one nucleic acid sequence 
selected from the group consisting of SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, 
SEQ ID NO:29 and SEQ ID NO:59. 
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87. The vector of claim 84, wherein the nucleic acid sequence encodes 

ketopahtoate reductase. ( ( ' ' ' • ' 

1 t it 

88. A vector comprising a mutant coaX gene, said mutant encoding a 
5 pantothenate kinase enzyme having reduced activity. ( 

1 j 1 1 ' 

89. A vector comprising an isolated coaX gene. ' 

90. A vector comprising an isolated Bacillus coaX gene. ( 

i 

91 . A vector comprising an isolated Bacillus subtilis coaX gene. 

* t 

92. The vector of any one of claims 86 and 89 to 91, which further comprises 
regulatory sequences. ! , 

93. The vector of any one of claims 83 to 85, 87 and 92, wherein the 
regulatory sequences comprise a constitutively active promoter. 

94. The vector of claim 93, wherein the constitutively active promoter 
20 comprises P veg (SEQ ID NO:41), P iS (SEQ ID NO:39) or P 26 (SEQ ID NO:40) 

sequences. 

95. The vector of claim 83, wherein the regulatory sequences comprise at 
least one artificial ribosome binding site (RBS). 

25 

96. The vector of claim 95, wherein the artificial RBS comprises a nucleotide 
sequence selected from the group consisting of SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:5 1 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:56 and SEQ ID NO:57. 

30 

97. A vector selected from the group consisting of pAN004, pAN005, 
pAN006, pAN236, pAN423, pAN428, pAN429, pAN441, pAN442, pAN443, pAN251, 
pAN267, pAN256, pAN257, pAN263, pAN240, pAN294, pAN296, pAN336, pAN341 
and pAN342. 

35 

98. A recombinant microorganism comprising the vector of claim 86 or 93. 
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99. Ani isolated nucleic acid molecule which encodes at least one Bacillus 
pantothenate biosynthetic gene. 



\ > 



1 00. The isolated nucleic acid molecule of claim 99 which encodes at least 
5 one Bacillus subtilis pantothenate biosynthetic gene. ■ 

' 1 • 1 

* ^ I 

101. The isolated nucleic acid molecule of claim 99 or 100 which encodes 
ketopantoate reductase. 

i 

10 1 02. An isolated Bacillus pantothenate biosynthetic enzyme polypeptide. 

1 03. An isolated Bacillus subtilis pantothenate biosynthetic enzyme 
polypeptide. 

15 1 04. An isolated Bacillus ketopantoate reductase polypeptide. 

1 05. An isolated Bacillus subtilis ketopantoate reductase polypeptide. 



20 



1 06. An isolated Bacillus aspartate-a-decarboxy lase polypeptide. 

107. An isolated Bacillus subtilis aspartate-a-decarboxylase polypeptide. 

108. An isolated nucleic acid molecule comprising a mutant coaX gene. 
25 109. An isolated nucleic acid molecule comprising a cooYgene. 

110. An isolated pantothenate kinase protein encoded by a coaX gent. 
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Figure 2. Plasmid pAN240 t containing sequences ligated upstream of 
the P 26 panBCD cassette, equivalent to the integrated version in strain 1 
PA221. 
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Figure Plasmid pAN004, containing the pnnBCDpperon expressed from PIS dndRBSl. 
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Figurt5QPlasmid pAN006< containing the panBCD operon expressed from P26 and RBS2. 
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Figure*? Construction ofpAS'423 
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Figure (# Construction ofpAN426 and pAN427. 
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Figure 7 Construction ofpA\42S ami pAS'42^. 
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Figure 3. Construction ofpAX-fJI. 
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Figure 9. Construction of pA.\44l. 
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Figure 10. Consintctiqn ofpASJJih 
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Figure | J 1 Structure ofpAN251, a plasmid designed to integrate d single copy o/^g-panEl at the 
pan El locus by double crossover* 1 , 
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Figure 121 Structure ofpAN267, a plasmid designed to stably integrate a Pis ilvBJNC 

cassette at the amyE locus. 
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Figure 13 Structure ofpAN257, a clone o/B. subtilis ilyD, in a low copy vector* 
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Figure l*{ Structure of pAN263, designed to stably integrate a single copy of 

P26 ilvD at the ilvD locus. 
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Figure >5 Structure of pAN261 desiois . Jr- , , 
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Figure Structure of P AN296, designed to delete most of the B. su btilis c a A gene and substitute a 
chloramphenicol resistance gene. ( 
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Figure Structure ofpAN281 f a plasmidfor expressing B. subtilis coaA after integration 
at the bpr locus, pAN282 and pAN283 have similar structures.' 
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FigurelZ Structure ofpAN341andpAN342, two independent PCR~ derived clones of 
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Figure 2-S Structure ofpAN294, a plasmidfor integrating mutagenized B. subtilis coaA 



at its native locus* 
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Figure2(o Structure ofpAN336 9 a ptasmid designed to delete B. subtil is coaX from the 
chromosome and replace U wish a kanamycin resistance gene. 
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■ 1 

t 

, , SEQUENCE LISTING 

<110> OMNIGENE BIOPRODUCTS ' ' ' 1 . . 

■ ■ - • 

<120> METHODS AND MICROORGANISMS FOR PRODUCTION OF 
PANTO-COMPOUNDS - 

<130> BGI-141CPP<p ! , ' ' 

, ■ ' '• ■ 

<140> 

<150> 'USSN 09/(400, 494 ( 

<151> 1999-09-21 , ' 

i 

<150> USSN 60/210,072 ' , 

<151> 2000-06-07 

i | 

<150> USSN 60/221,836 
<151>" 2'000-07-28 

<150> USSN 60/221,836 ' 
<151> 2000-08 7 24 

<160> 94 

<170> Patentln Ver. 2.0. 

<210> 1 
<211> 311 
<212> PRT 

<213> Haemophilus influenzae 



<400> 1 

Met Glu Phe Ser Thr Gin Gin Thr Pro Phe Leu Ser Phe Asn Arg Glu 
15 10 15 

Gin Trp Ala Glu Leu Arg Lys Ser Val Pro Leu Lys Leu Thr Glu Gin 

20 25 30 



Asp Leu Lys Pro Leu Leu Gly Phe Asn Glu Asp Leu Ser Leu Asp Glu 
35 40 45 

Val Ser Thr lie Tyr Leu Pro Leu Thr Arg Leu lie Asn Tyr Tyr lie 
50 55 60 

Asp Glu Asn Leu His Arg Gin Thr Val Leu His Arg Phe Leu Gly Arg 
65 70 75 80 



Asn Asn Ala Lys Thr Pro Tyr lie lie Ser He Ala Gly Ser Val Ala 

85 90 95 

Val Gly Lys Ser Thr Ser Ala Arg He Leu Gin Ser Leu Leu Ser His 

100 105 110 

Trp Pro Thr Glu Arg Lys Val Asp Leu He Thr Thr Asp Gly Phe Leu 

115 120 125 



t 



WO 01/21772 PCT/US00/25993 

i 1 * 

-2- ■ . ' 

I i » t 

l 

i 

Tyr Pro Leu Asp Lys Leu Lys Gin, Asp Asn Leu .JLeii Gin Lys Lys Gly, 

130 11 ,135 ■ * 140 , ' ' 

1 ■ 

Phe Pro Val Ser Tyr Asp Thr Pro' Lys Leu He Arg Phe Leu Ala Asp 
145 1 150 155 160 

Val Lys Ser Gly Lys Ser Asn Val. Thr Ala ( Pro ( lie Tyr Ser His Leu 

■ 165 ■ ' 170 , 175 

Thr Tyr Asp lie lie Pro Asp Lys Phe Asp Val Val Asp Lys Pro Asp 
, , , 180 ' ! 185 , 190 

lie Leu lie Leu Glu Gly Leu Asn Val Leu 6ln Thr Gly Asn Asn Lys 

195 200 • . 205 

■ t 

i 

Thr Asp Gin Thr Phe, Val Ser Asp Phe- Val Asp Phe Ser lie Tyr Val 

210 ' ' 215 220- 

, -i 

Asp Ala, Glu ,Glu Lys Leu Leu Lys Glu Trp Tyr lie ,,Lys Arg Phe Leu 
225 230 235 240 

Lys Phe Arg Glu Ser Ala Phe ! Asn Asp Pro Asn Ser Tyr Phe ' Lys His 

. 245 250 255 

Tyr Ala Ser Leu Ser Lys Glu Glu Ala lie Ala Thr Ala Ser Lys lie 

260 , 265 270 

Trp Asp Glu lie Asn Gly Leu Asn Leu Asn Gin Asn lie Leu Pro Thr 
275 280 285 

Arg Glu Arg Ala Asn Leu lie Leu Lys Lys Gly His Asn His Gin Val 
290 295 300 

Glu Leu lie Lys Leu Arg Lys 
305 310 



<210> 2 
<211> 316 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Ser lie Lys Glu Gin Thr Leu Met Thr Pro Tyr Leu Gin Phe Asp 
15 10 15 

Arg Asn Gin Trp Ala Ala Leu Arg Asp Ser Val Pro Met Thr Leu Ser 

20 25 30 

Glu Asp Glu lie Ala Arg Leu Lys Gly lie Asn Glu Asp Leu Ser Leu 
35 40 45 

Glu Glu Val Ala Glu lie Tyr Leu Pro Leu Ser Arg Leu Leu Asn Phe 
50 55 60 

Tyr lie Ser Ser Asn Leu Arg Arg Gin Ala Val Leu Glu Gin Phe Leu 
65 70 75 80 
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Gly Thr Asn Gly Gin Arg lie Pro Tyr lie lie Ser lie Ala Gly Ser 



85 



90 



95 ' 



Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu Gin Ala Leu Leu 

100 105 ' 110 

Ser Arg Trp Pro Glu His Arg Arg Val Glu Leu lie Thr Thr Asp Gly 

115 ,120 >125 ' 

i 

Phe Leu His Pro Asn Gin Val Leu Lys Glu Arg Gly Leu Met Lys Ly;s 



130 



135 



140 



Lys' Gly Phe Pro Glu Ser Tyr I Asp Met His Arg Leu Val Lys Phe Val 
145 150 155 160 

Ser Asp Leu Lys Ser Gly Val Pro Asn Val Thr Ala Pro Val Tyr Ser 

165 170 175 

His Leu lie Tyr Asp Val lie Pro Asp Gly Asp Lys Thr Val Val Gin 

180 ' 185 190 

Pro Asp lie Leu lie Leu Glu^Gly Leu Asn Val Leu Gin Ser Gly Met 
195 200 205 

i 

Asp Tyr Pro His Asp Pro His His Val Phe Val Ser Asp Phe Val Asp 
210 215 220 

Phe Ser lie Tyr Val Asp Ala Pro Glu Asp Leu Leu Gin Thr Trp Tyr 
225 230 235 240 

lie Asn Arg Phe Leu Lys Phe Arg Glu Gly Ala Phe Thr Asp Pro Asp 

245 250 255 

Ser Tyr Phe His Asn Tyr Ala Lys Leu Thr Lys Glu Glu Ala lie Lys 

260 265 270 

Thr Ala Met Thr Leu Trp Lys Glu lie Asn Trp Leu Asn Leu Lys Gin 
275 280 285 

Asn lie Leu Pro Thr Arg Glu Arg Ala Ser Leu lie Leu Thr Lys Ser 
290 295 300 

Ala Asn His Ala Val Glu Glu Val Arg Leu Arg Lys 
305 310 315 



<210> 3 
<211> 319 
<212> PRT 

<213> Bacillus subtilis 
<400> 3 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 
15 10 15 



Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser lie Ala Val Ser 

20 25 30 
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Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp, Tyr Leu Ser, Val 

35 • 40' 1 ' ,45. , ' 

' i < 

Glu Glu Val Glu Thr lie, Tyr lie. Pro Leu Val Arg Leu Leu His Leu 

50 55 .60 

His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 

65 70 75 ' ' 80. 



( Lys His Pro His Ser Ala Lys 

' < 85 



lie Pro Phe lie He Gly He Ala Gl<y t 
90 .' ' 95 



Ser Val Ala Val Gly Lys Ser ! Thr Thr Ala Arg He Leu Gin Lys Leu 

100 : 105 110 



Leu Ser Arg Leu Pro Asp Arg ' Pro Lys Val Sei Leu He Thr Thr Asp 
115 120 125, 



Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys Lys Lys Asn Met Met Ser 



130 



135 



140 



Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe 
145 150 155 160 

i 

Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 

165 170 175 

Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe Glu Val Val Glu 

180 185 190 

Gin Ala Asp He Val He He Glu Gly He Asn Val Leu Gin Ser Pro 
195 200 205 

Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 215 220 

Phe Asp Phe Ser He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
225 230 235 240 

Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn 

245 250 255 

Pro Asp Ser Tyr Phe His Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala 

260 265 270 

Asp Glu Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 

Tyr Glu Asn He Leu Pro Thr Lys Phe Arg Ser Asp Leu He Leu Arg 
290 295 300 



Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
305 310 315 



<210> 4 
<211> 312 
<212> PRT 
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<213> Mycobacterium leprae 



<400> 4 1 ' i , 

Met Pro Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp 'Arg Lys 
1 ,5 10' 15 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro Leu ^la Leu Thr Glu Glu 

20 1 25 ' 1 30 , 



i 



Glu Leu lie Gly Leu Arg Gly Leu Gly Glu Glri lie Asp Leu Leu Glu 

35 '40 i 45 

i i ' 1 

Val G1U Glu VajL Tyr Leu Pro Leu Ala Arg Leu lie His Leu Gin Val 
50 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 ' 70 ' 75 80 

Pro Gin Gin Asn Pro Gly Arg Pro Val Pro Phe lie lie Gly Val .Ala 
' ' . 85 90 1 95 

Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala Arg Val Leu, Gin Ala 

100 105 110 

Leu Leu Ala Arg Trp Asp His His Thr Arg Val Asp Leu Val Thr Thr 
115 120 , 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gly Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ala Asp Tyr Ala, Cys Ala Pro Val 

165 170 175 

Tyr Ser His Leu Arg Tyr Asp Thr lie Pro Gly Ala Lys His Val Val 

180 185 190 

Arg His Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg lie Gin Asp lie Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Gly Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 

245 250 255 

Tyr Ser Ala Leu Thr Asp Ser Lys Ala lie lie Ala Ala Arg Glu lie 

260 265 270 

Trp Arg Ser lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 
275 280 285 



Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 



i M 
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i ( 

290 , | 295 , ,3.00 

i ' i ' , 1 ■ 

Asn Arg Leu Arg Leu Arg Lys Leu ' , 

305 ' 310 ' 1 



<210> 5 

<211> 312 i ' 

<212> PRT 1 . i 

<213> Mycobacterium tuberculosis 1 ' 

<400> 5 ' 

Met Ser Arg Leu Ser Glu Pro Ser Pro Tyr Val Glu Phe Asp Arg Arg 

1 '5 10 - 15 

i 

i 

Gin Trp Arg Ala Leu Arg Met Ser Thr Pro .Leu Ala Leu Thr Glu Glu 

20 ,25 30 

i 

Glu LeUj Val Gly Leu Arg Gly Leu Gly Glu Gin lie, Asp Leu Leu Glu 
35 40 45 

Val Glu Glu Val Tyr Leu Pro Leu Ala Arg Leu lie .His Leu 1 Gin Val 
50 i 55 60 

Ala Ala Arg Gin Arg Leu Phe Ala Ala Thr Ala Glu Phe Leu Gly Glu 
65 70 ( 75 80 

Pro Gin Gin Asn Pro Asp Arg Pro Val Pro Phe Tie lie Gly Val Ala 

85 90 95 

Gly Ser Val Ala Val Gly- Lys Ser Thr Thr Ala Arg Val Leu Gin Ala 

100 105 110 

Leu Leu Ala Arg Trp Asp His His Pro Arg Val Asp Leu Val Thr Thr 
115 120 125 

Asp Gly Phe Leu Tyr Pro Asn Ala Glu Leu Gin Arg Arg Asn Leu Met 
130 135 140 

His Arg Lys Gly Phe Pro Glu Ser Tyr Asn Arg Arg Ala Leu Met Arg 
145 150 155 160 

Phe Val Thr Ser Val Lys Ser Gly Ser Asp Tyr Ala Cys Ala Pro Val 

165 170 175 

Tyr Ser His Leu His Tyr Asp lie lie Pro Gly Ala Glu Gin Val Val 

180 185 190 

Arg His Pro Asp lie Leu lie Leu Glu Gly Leu Asn Val Leu Gin Thr 
195 200 205 

Gly Pro Thr Leu Met Val Ser Asp Leu Phe Asp Phe Ser Leu Tyr Val 
210 215 220 

Asp Ala Arg lie Glu Asp lie Glu Gin Trp Tyr Val Ser Arg Phe Leu 
225 230 235 240 

Ala Met Arg Thr Thr Ala Phe Ala Asp Pro Glu Ser His Phe His His 
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245 



250 



255 



Tyr Ala Ala Phe Ser Asp Ser Gin Ala Val Val Ala' Ala Arg Glu ,11'e 

260 265 ' 270 - , • 

Trp Arg Thr lie Asn Arg Pro Asn Leu Val Glu Asn lie Leu Pro Thr 

275 ! 280 285 



Arg Pro Arg Ala Thr Leu Val Leu Arg Lys Asp Ala Asp His Ser lie 



290 



295 



Asn Arg Leu Arg Leu Arg Lys 
, 305 310 



300 



Leu 



r 



<210> 6 
<211> 329 
<212> PRT 

<213> Streptomyces coelicolbr 



<400> 6 

Met lie Ser Pro Val Pro Serl lie Pro Arg Ser Ala His Arg Gin Arg 



10 



15 



Pro Glu Ala Thr Pro Tyr Val, Asp Leu Thr Arg Pro Glu Trp Ser Ala 

20 25 30 

i 

Leu Arg Asp Lys Thr Pro Leu Pro Leu Thr Ala Glu Glu Val Glu Lys 
35 40 45 

Leu Arg Gly Leu Gly Asp Val lie Asp Leu Asp Glu Val Arg Asp lie 
50 55 60 

Tyr Leu Pro Leu Ser Arg Leu Leu Asn Leu Tyr Val Gly Ala Thr Asp 
65 70 75 80 

Gly Leu Arg Gly Ala Leu Asn Thr Phe Leu Gly Glu Gin Gly Ser Gin 

85 90 95 

Ser Gly Thr Pro Phe Val lie Gly Val Ala Gly Ser Val Ala Val Gly 

100 105 110 

Lys Ser Thr Val Ala Arg Leu Leu Gin Ala Leu Leu Ser Arg Trp Pro 
115 120 125 

Glu His Pro Arg Val Glu Leu Val Thr Thr Asp Gly Phe Leu Leu Pro 
130 135 140 

Thr Arg Glu Leu Glu Ala Arg Gly Leu Met Ser Arg Lys Gly Phe Pro 
145 150 155 160 

Glu Ser Tyr Asp Arg Arg Ala Leu Thr Arg Phe Val Ala Asp lie Lys 

165 170 175 



Ala Gly Lys Ala Glu Val Thr Ala Pro Val Tyr Ser His Leu He Tyr 

180 185 190 



Asp He Val Pro Asp Gin Arg Leu Val Val Arg Arg Pro Asp He Leu 
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195 



200 



205 



* i 1 



lie Val Glu Gly Leu Asn Val Leu Gin Pro Ala Leu/ Pro Gly Lys Asp 

210 215 • ' 220 1 ■ ' 

Gly Arg Thr Arg Val Gly Leu, Ala Asp Tyr Phe Asp Phe Ser Val Tyr 
225 230 235 240 

Val Asp Ala Arg Thr Glu Asp . lie Glu Arg Trp Tyr Leu Asn Arg Phe 



245 



Arg Lys Leu Arg Ala Thr Ala 
' 260 



250 



255 



Phe Gin Asn Pro Ser Ser Tyr Phe Arg 
265 270 



Lys Tyr Thr Gin Val Ser Glu |Glu Glu Ala Leu Asp Tyr Ala Arg 
275 280 285 

Thr Trp Arg Thr lie Asn Lys Pro Asn Leu Val Glu Asn Val Ala Pro 
290 295. ! ' 300 



Thr Arg Gly Arg A3. a Thr Leu Val Leu Arg Lys Gly Pro Asp His Lys 



305 



310 



315 



320 



Val Gin Arg Leu Ser Leu Arg Lys Leu 

325 



<210> 7 
<211> 265 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 7 

Met Leu Leu Thr lie Asp. Val Gly Asn Thr His Thr Val Leu Gly Leu 
15 10 15 

Phe Asp Gly Glu Asp lie Val Glu His Trp Arg lie Ser Thr Asp Ser 

20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu^ Met Gly 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly lie Asp Gly lie 
50 55 60 

Ala lie Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu- Pro Gly 

85 90 95 

Val Lys Thr Gly Val Pro lie Leu Thr Asp His Pro Lys Glu Val Gly 

100 105 110 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 



Pro Ala He Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
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130 135 140 



Ser Ala Arg Gly 1 Glu Tyr lie Gly 1 Gly Val 11$. Ala Pro Gly lie Glu 

145 i • 150 , 155 , ; ' . ' , ■ 160 

i 

lie Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys lie 

165 170 , , 175 

i '1,1., 

Glu Val Ala Arg Pro Arg Ser Val lie Gly : Lys Asn Thr Val Glu Ala 

180 ' 185 ; / ■ 190 

', 1 i • . ' 

Met Gin ' Ser Gly He Val Tyr Gly Phe fila Gly Gin Val Asp. .Gly Val 
• 195 , 200 t V 205 

t . i • > 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 
210 215 220 



Val He Ala Thr Gly Gly Leu Ala Pro Me,t Val Leu Gly Glu Ser Ser 
225 230 235 240 

Val He Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 

245 250 ,255 

Val Tyr Glu Arg Asn Val Ser Arg Met 

260 2 65 



<210> 8 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<400> 8 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
1 5 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg He 

20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg He Met Leu Asp Gin 
65 70 75 80 

Tyr Trp Pro Ser Val Pro His Val Leu He Glu Pro Gly Val Arg Thr 

85 90 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 

100 105 110 

lie Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala He 
115 120 125 

Val Val Asp Phe Gly Ser Ser He Cys Val Asp Val Val Ser Ala Lys 
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130 



• i 



135 



- 10- 



140 



Gly Glu Phe Leu Gly Gly Ala lie Ala Pro Giy Val Gin Val ( Ser Ser 

145 " 150 ■ ,155 • • ' . ' ■ 160 



Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 



165 



170 



175 



Arg Pro Arg Ser Val Val Gly Lys Asn ,Thr Val Glu Cys Met Gin Ala 

180 185 . 190 

' t » . , 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val* Gly Arg 

195 i 200 1 , 205 



lie Arg Glu Asp Val Ser Gly Phe Ser Val, Asp His Asp Val Ala lie 
210 215 ■ , , 220 



Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu His Thr 

225 j 230 235 ( 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 

245 1 250 1 255 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 

260 265 270 



<210> 9 
<211> 258 
<212> PRT 

<213> Bacillus subtilis 
<400> 9 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin Tie Asp Gly lie He He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 

100 105 110 



Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 
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Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu, Asn Lys Gin 
130 135 140' ' ' ■ 



Tyr Met Gly Gly Ala lie Ala 'Pro Gly lie Thr lie Ser Thr Glu Ala 
145 150 155 - 160 



Leu Tyr Ser Arg Ala Ala Lys , Leu Pro Arg lie Glu« lie Thr Arg Pro 

165 170 ' 175 



Asp Asn lie lie Gly Lys Asn 

180 



Thr Val Ser Ala Met Gin Ser Gly lie 
185 190 



Leu Phe Gly Tyr Val Gly Gln ; Val Glu Gly lie Val Lys Arg Met Lys 



195 



200 



205 



t 1 



Trp Gin Ala Lys Gin Asp Leu Lys Val lie Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu lie Ala Asn Glu Ser Asp Cys lie Asp lie Val Asp Pro Phe 
225 230 235 240 

i 

i 

; i 

Leu Thr Leu Lys Gly Leu Glu Leu lie Tyr Glu Arg Asn Arg Val Gly 

245 250 255 



Ser Val 



<210> 10 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 10 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp lie Gly Asn Thr Thr Thr 
15 10 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 

20 25 30 

lie Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro lie Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 75 80 

Lys Arg His Phe Met lie Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 

85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 

100 105 110 



Arg Leu Cys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
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115 

i 1 

i 

Tyr Ala Val Val Val 
130 



120 

Asp Phe Gly 
135 



125 



Thr Ser ,Thr Asq Phe Asp Val 'Val 

140 



Gly Arg Gly Arg Arg 
145 



Phe Leu, Gly 
150 



Gly lie Leu Ala Thr Gly Ala Gin 

• .155 ■ 160 



Val Ser Ala Asp Ala 

165 



Leu Phe: Ala Arg Ala' Ala Lys Leu. Pro Arg lie 

170 ' 1 175 ' - , 



Thr Leu Gin Ala Pro Glu Thr 

180 



Ala 



He Gly Lys Asn Thr Val His Ala 
185 190 



Leu Gin Ser Gly Leu 
195 



Val Phe] Gly 
i 200 



I 



I.* • 



Tyr Ala Glu Met Val Asp Gly Leu 

205 



Leu Arg Arg He Arg 
210 



Ala Glu Leu 

215 ! 



Pro Gly Glu Ala Val Ala Val Ala 

220 



Thr Gly Gly Phe Ser 

225 

Tyr Tyr Asp Glu Thr 

245 



Arg Thr Val Gin Gly lie Cys Gin Glu He Asp 

230 , " 235 240 

i 

i < 
Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 

250 255 



Ser Arg Ser Glu Val Arg 

260 



<210> 11 
<211> 212 
<212> PRT 

<213> Desulf ovibrio vulgaris 
<400> 11 

Met Thr Gin His Phe Leu Leu Phe Asp lie Gly Asn Thr Asn Val Lys 
15 10 15 

He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 

20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser He Gly Leu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val Gly Ala Cys Val 
50 55 60 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu He Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro Gly Asp He Ala 

85 90 95 

He Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val Gly Ala Asp 

100 105 110 

Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Leu Tyr Pro Gly Pro Arg 
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115 120, . 125' 

» t i ' • . • 1 i .) ,. ; . 

Ser Leu Val Ser' Val Asp Phe Gly Thr Ala Thr,. Thr Phe Asp Cys Val 

130 1 1 • ■ 135 , 140 ■ 

i ' 

Glu Gly Gly Ala Tyr Leu Gly Gly Leu lie Cys Pro Gly Val Leu Ser 
145 150 155 , ' 160 

ii ) ■ - ' * 1 

eu Ser Ser Arg Thr Ala Lys Leu Pro Arg lie Ser 

165 170 ■ ! , '■ 175 ' 

L^u Glu'Val Glu Glu Asp Ser Pro Val Jle Gly Arg Ser Thr (1 Thr Thr 

180 185 ' t 190 

Ser Leu Asn His Gly Phe lie Phe Gly Phe Ala Ala Met Thr Glu Gly 1 
195 , 200 ' 205 



Val. Leu Ala Ala . 
210 



<210> 12, 
<211> 246 
<212> PRT 

<213> Thermotoga maritima 

i 

<400> 12 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser lie 
1 .5 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 

20 25 30 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
35 40 45 

Asp Ala Met Arg Glu lie Lys Gly lie Gly Val Ala Ser Val Val Pro 
50 55 60 

Thr Gin Asn Thr Val lie Glu Arg Phe Ser Gin Lys Tyr Phe His lie 
65 70 75 80 

Ser Pro lie Trp Val Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 

85 90 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 

100 105 110 

Phe Val Lys Glu Tyr Gly Lys Asn Gly lie lie lie Asp Met Gly Thr 
115 120 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
130 135 140 

lie Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
145 150 155 160 

Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 



1 



1 
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165 

i 



17Q 



175 



i ■ i * ' | 

Lys Asp. Thr Glu Glu Asn lie Arg Leu Gly Val. Val Asn Gly ( Ser Val 



180 



' 185 



190 



Tyr Ala Leu Glu Gly lie lie Gly Arg lie Lys Glu Val Tyr Gly Asp 
195 200 ' 205 



Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys lie 1 Val Lys Asp Met 

210 215 • '• • . 220 



lie Lys His ; Glu lie Phe Asp Glu Asp Leu Thr lie Lys Gly' Val Tyr 
225 ' 1 230 £3$ 1 240 



His Phe Cys Phe Gly Asp 

' 245 



<210>.l^ ., 
<211> 273 
<212> PRT 

<213> Treponema pallidum 
<400> 13 

Met Leu Leu lie Asp Val Gly Asn Ser His Val Val Phe Gly lie Gin 
1 5 10 15 

Gly Glu Asn Gly Gly Arg Val Cys Val Arg Glu Leu Phe Arg Leu Ala 

20 25 30 

i i 

1 1 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu lie His Ala 
35 40 45 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala Ser Leu Arg Asp Ala Phe 
50 55 60 

lie Ser Ser Val Val Pro Val Leu Thr Lys Thr He Ala Asp Ala Val 
65 70 75 80 

Ala Gin He Ser Gly Val Gin Pro Val Val Phe Gly Pro Trp Ala Tyr 

85 90 95 

Glu His Leu Pro Val Arg He Pro Glu Pro Val Arg Ala Glu He Gly 

100 105 110 

Thr Asp Leu Val Ala Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 120 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
130 135 140 

Asp Gly Thr Gly Leu He Gin Gly Val Ala He Ala Pro Gly Leu Arg 
145 150 155 160 

Thr Ala Val Gin Ser Leu His Thr Gly Thr Ala Gin Leu Pro Leu Val 

165 170 175 



Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 
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180 



- 15 - 

185 



190 



t i ■ 



Val'Gln Ala Gly Val Val Arg Gly Thr Leu Phe Val' lie Arg Ala , Met 

195 200 , ' ■ , ' 205 

* . t 1 

i 

lie Ala Gin Cys Gin Lys Glu Leu Gly Cys Arg Cys Ala Ala Val lie 
210 215 220 

Thr Gly Gly Leu Ser Arg Leu Phe Ser Ser Glu. Val Asp Phe Pro Pro 



225 



230 



lie Asp Ala Gin Leu Thr Leu 

245 



235 



,240 



Ser Gly Leu Ala His He Ala Arg Leu 
250 . 255 



Val Pro Thr Ser Leu Leu Pro; Pro Ala Thr Val Ser Gly Ser Ser I Gly 

260 ! 265 270 

Asn 



<210> 14 
<211> 262 j ' 

<212> PRT ! 1 

<213> Borrelia burgdorferi 

<400> 14 

Met Asn Lys Pro Leu Leu Ser Glu Leu He He Asp lie Gly Asn Thr 
15 10 15 

Ser He Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe He 

20 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 40 45 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe He Ser Ser 
50 55 60 

Val Val Pro He Leu Asn Glu Thr Phe Lys Asn Val He Phe Ser Phe 
65 70 75 80 

Phe Lys He Lys Pro Leu Phe He Gly Phe Asp Leu Asn Tyr Asp Leu 

85 90 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 

100 105 110 

Phe Ala Asn Leu Val Ala Ala He Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr He Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly He Leu Gly Gly He He Asn Ser Gly Pro Leu lie Asn 
145 150 155 160 



Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu He Lys Lys Phe Pro lie 
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i 



165 •. , . ,17p t .175 

i ' . ' ' ' ' , : 



. ! 



Ser Thr Pro Asn Asn Leu Leu Glu Arg Thr Thr. Ser .Gl,y Ser, Val Asn 

"l80 ' 185 ■ ,' ■ 190 ' 

i 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu lie Glu Gly Val Tyr Arg 

195 200 1 205 



i ' 1 



Asp lie Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu lie lie Thr Gly 

210 215 ' '220 

\ • i ■ ■ 

Gly Asn Ala Asp Leu lie Leu Ser Leu Tie Glu lie Glu Phe' lie Phe 
225 ' 1 230 235 240 

i + * 

Asn lie His Leu Thr Val Glu Gly Val Arg. lie Leu Gly Asn Ser lie 

245 i 250. 255 

i 

Asp Phe Lys Phe Val Asn 

,260 , ' 



<210> 15 ' 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 

i 

t 

<400> 15 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp lie Ala 
15 10 15 

LeuTrp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His Glu 

20 25 30 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu Gly lie Ser 
35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys lie Pro Lys lie 
50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro lie Gin Val Asp Tyr Lys Thr 
65 70 75 80 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 

85 90 95 

Lys Phe Tyr Gly Lys Asn Val Val Val lie Ser Ala Gly Thr Ala Leu 

100 105 110 

Val lie Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe He Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys He Leu Ser Asp Leu Ala Glu Gly 
130 135 140 

He Pro Glu Phe Phe Pro Glu Glu Val Glu lie Phe Leu Gly Arg Ser 
145 150 155 160 

Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 
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i * * * 

165 170 175 

i t 

. * i 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe* Lys Arg Lys ( Phe 

180 ,185 * ' 190 - 

Lys Val Val lie Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 ! 200 205 

lie Tyr Asp Pro Leu Leu Val His Arg Gly. Met, Arg Asn Leu Leu Tyr 
210 215! ■ 220 

i 

i 

Leu Tyr His Arg lie 
225 



<210> 16 \ 
<211> 257 
<212> PRT 

<213> Synechocystis sp. 
<400> 16 

Met Glu Thr Ser Lys Pro Glyl Cys Gly Leu Ala Leu Asp Asn Asp Lys 
1 5 i 1 10 15 

i 

Gin Lys Pro Trp Leu Gly Leu Met lie Gly Asn Ser Arg Leu His Trp 

20 25 30 

Ala Tyr Cys Ser Gly Asn Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys lie Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 

85 90 95 

Pro Ser Phe Gly lie Asp Arg Ala Leu Ala Gly Leu Gly Thr Gly Leu 

100 105 110 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120 125 

lie Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala lie Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 150 155 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 

165 170 175 

Ala Leu Asp Thr Pro Ser Ala lie Phe Ser Gly Val Val Tyr Gly Val 

180 185 190 

Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
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195 



200 



205 



Gly Ala Ala Met Val He Thr Gly Gly Asp ( Gly Lys' He Leu His Gly 
210 , 215 220 ■ ' 

Phe Leu Lys Glu His Ser Pro, Asn Leu Ser Val Ala Trp Asp Asp Asn 
225 230 ,235 240 



Leu He Phe Leu Gly Met Ala Ala He His His Gly Asp Arg Pro He 



245 



t Cys 



250 * 



255 ' 



<210> 17 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 



<rdon> 1? 



Met Pro Ala Arg Gin Ser Phe ! Thr Asp Leu Lys Asn Leu Val Leu Cys 



10 



15 



Asp He Gly Asn Thr Arg He His Phe Ala Gin Asn Tyr Gin Leu Phe 

20 25 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly He Gin Lys Glu He 
35 40 45 

Phe Tyr He Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 1 55 60 

Tyr Pro Asn Ala Lys Asn He Ala Gly Phe Phe His Leu Glu Thr Asp 
65 70 75 80 

Tyr Val Gly Leu Gly He Asp Arg Gin Met Ala Cys Leu Ala Val Asn 

85 90 95 

Asn Gly Val Val Val Asp Ala Gly Ser Ala He Thr He Asp Leu He 

100 105 110 

Lys Glu Gly Lys His Leu Gly Gly Cys He Leu Pro Gly Leu Ala Gin 
115 120 125 

Tyr He His Ala Tyr Lys Lys Ser Ala Lys He Leu Glu Gin Pro Phe 
130 135 140 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 
145 150 155 160 

Val Asn Tyr Gly Met Val Leu Ser Val lie Ala Cys He Gin His Leu 

165 170 175 

Ala Lys Asn Gin Lys lie Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 

180 185 190 



Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
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195 200. 205' 



i i 



Gly Met Glu lie 1 Ala Leu Lys Lys' Ala Gly lie Leu Glu Cys Lys 
210 215 , 220 , 



<210> 18 

<211> 267 . ( 

<212> PRT 

<213> Bordetella pertussis 



> ii 



<400> 18 ' ' . 

Met He He Leu He Asp Ser Gly Asn Ser Arg Leu Lys Val Gly Trp 
1 5 10 15 

Phe Asp Pro Asp Ala Pro Gin Ala Ala Arg Glu Pro Ala Pro Val Ala 

20 25 .30 

i 

\ 

Phe Asp Asn Leu Asp Leu Asp Ala Leu Gly Arg Trp Leu Ala Thr Leu 
' 1 35 40 45 

Pro Arg Arg Pro Gin Arg Ala Leu. Gly Val Asn Val Ala Gly Leu Ala 
50 , 55 60 

i 

Arg Gly Glu Ala He Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp He 
65 70 75 80 

Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 

85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 

100 105 110 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr He Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu He Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 

165 170 175 

Tyr Pro He Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 

180 185 190 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 
195 200 205 

Gly Gin Ala Pro Glu He Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Arg Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 

Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 
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245 



< < 



250 



255 



Ala Ala Leu Ala' Ala Gin Gly Ala Pro Thr Ala 

' l~ ~ * r- • 



260 



265 



<210> 19 

<211> 777 ' i 

<212> DNA 

<213> Bacillus subtilis 

i i i 

<220> ( ' 
<221> CDS 1 
<222> (1) . . (774), 



<400> 19 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
1,5 10 , 15' 



48 



tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 

20 25 30 



cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 
His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 



144 



cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 



192 



gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 



240 



ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr. Gly Leu 

85 90 95 



288 



aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 

100 105 110 



336 



aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 . 125 



384 



gat ttc gga acc gec aca acg tac tgc tat att gat- gaa aac aaa caa 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 



432 



tac atg ggc ggg gcg att gee cct ggg att aca att teg aca gag gcg 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 



480 



ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg ccc 528 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
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165 



170 



175 



gac aat att ate gga aaa aac act gtt age ,gcg atg caa tct gga'att 
Asp Asn He He Gly Lys Asn Thr Val Ser Ala Me,t Gin Ser Gly tile 

180 185 190 



576 



tta ttt ggc tat gtc ggc caa gtg gaa gga ( atc gtt aag cga atg aaa 624 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val 1 Lys Arg Met Lys 

195 ; 200 • " , 205 . 

i i 1 

tgg cag gca aaa cag gac ctc ( aag gtc att gcg aca gga ggc ctg gcg 672 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 



210 



215 



220 



ccg etc att gcg aac gaa tea; gat tgt ata gac ate gtt gat cca ttc 
Pro Leu He Ala Asn Glu Ser^ Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 240 



720 



tta ace eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 
Leu Thr Leu Lys Gly Leu' Glu Leu lie Tyr Glu Arg Asn Arg Val Gly 

245 : 250 255 



768 



agt gta tag 
Ser Val. 



777 



<210> 20 
<211> 960 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (957) 

<400> 20 

gtg aaa aat aaa gaa ctt aac eta cat act tta tat aca cag cac aat 48 

Met Lys Asn Lys Glu Leu Asn Leu His Thr Leu Tyr Thr Gin His Asn 

15 10 15 

egg gag tct tgg tct ggt ttt ggg ggg cat ttg teg att get gta tct 96 
Arg Glu Ser Trp Ser Gly Phe Gly Gly His Leu Ser He Ala Val Ser 

20 25 30 

gaa gaa gag gca aaa get gtg gaa gga ttg aat gat tat eta tct gtt 144 
Glu Glu Glu Ala Lys Ala Val Glu Gly Leu Asn Asp Tyr Leu Ser Val 
35 40 45 

gaa gaa gtg gag acg ate tat att ccg ctt gtt cgc ttg ctt cat tta 192 
Glu Glu Val Glu Thr He Tyr He Pro Leu Val Arg Leu Leu His Leu 
50 55 60 

cat gtc aag tct gcg get gaa cgc aat aag cat gtc aat gtt ttt ttg 240 
His Val Lys Ser Ala Ala Glu Arg Asn Lys His Val Asn Val Phe Leu 
65 70 75 80 



aag cac cca cat tea gee aaa att ccg ttt att ate ggc att gec ggc 
Lys His Pro His Ser Ala Lys He Pro Phe He He Gly He Ala Gly 



288 
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85 



90 



95, 



agt gtc gca gtc gga aaa 
Ser Val Ala Val Gly Lys 

100 



age acg acg gcg egg, ate ttg cag aag 'ctg 
Ser Thr Thr Ala Arg lie Leu Gin Lys 'Leu 
105 110 



336 



ctt 
Leu 



teg 
Ser 



ggt ttt 
Gly Phe 
130 

aga aaa 
Arg Lys 
145 



cgt ttg 
Arg Leu 
115 



tta 
Leu 



gga 
Gly 



ttt 
Phe 



ttt 
Phe 



cct gac cgt 
Pro Asp Arg 



cct 
Pro 



cct 
Pro 



act 
Thr 



gaa 
Glu 
150 



cca 
Pro 
120 

gec gag 
Alai Giu 
135 



age 
Ser 



tat 
Tyr 



aaa 
Lys 



ctg 
Leu 



gat 
Asp 



gtg 
Val 



aaa 
Lys 



gta 
Val 



age 
Ser 



aag 
Lys 



aag 
Lys 
155 



ctt 
Leu 



aaa 
Lys 
140 

gcg 
Ala 



ate 
He 
125 

i 

aat 
Asn 



ctg 
Leu 



acg aca gat 
Thr Tnr Asp 



at'g atg tea 
Met Met Ser 



384 



etc 
Leu 



I 



i-' 



gaa ttt 
Glu Phe 
160 



4 32 



480 



ttg aat gac tta aaa tea gga aag gac age gta aag gec ccg gtg tat 
Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser Val Lys Ala Pro Val Tyr 



165 



170 



tec cat eta acc tat gac cgc gag gaa ggt gtg ttc 
Ser His Leu Thr Tyr Asp Arg Glu Glu Gly Val Phe 

180 185 



gag 
Glu 



gtt 
Val 
190 



gta 
Val 



gaa 
Glu 



528 



576 



cag 
Gin 



gcg 
Ala 



gat 
Asp 
195 



att 
He 



gtg 
Val 



att 
He 



att 
He 



gaa 
Glu 
200 



ggc 
Gly 



att 
He 



aat 
Asn 



gtt 
Val 



ctt 
Leu 
205 



cag 
Gin 



teg 
Ser 



ccc 
Pro 



624 



acc ttg gag gat gac egg gaa aac ccg cgt att ttt gtt tec gat ttc 
Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg He Phe Val Ser Asp Phe 
210 215 220 



672 



ttt gat ttt teg 
Phe Asp Phe Ser 
225 



att tat gtg gat gcg gag gaa age egg att ttc act 
He Tyr Val Asp Ala Glu Glu Ser Arg He Phe Thr 
230 235 240 



720 



tgg tat tta gag cgt ttt cgc ctg ctt egg gaa aca get ttt 
Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg Glu Thr Ala Phe 

245 250 



caa aat 
Gin Asn 
255 



768 



cct 
Pro 



gat 
Asp 



tea 
Ser 



tat 
Tyr 
260 



ttt 
Phe 



cat 
His 



aaa 
Lys 



ttt 
Phe 



aaa 
Lys 
265 



gac 
Asp 



ttg 
Leu 



tec 
Ser 



gat 
Asp 



cag 
Gin 
270 



gag 
Glu 



get 
Ala 



816 



gac 
Asp 



gag 
Glu 



atg gca gec teg att tgg gag agt gtc aac egg ccg aat tta 
Met Ala Ala Ser He Trp Glu Ser Val Asn Arg Pro Asn Leu 
275 280 285 



864 



tat 
Tyr 



gaa 
Glu 
290 



aat 
Asn 



att 
He 



ttg 
Leu 



cca 
Pro 



act 
Thr 
295 



aaa 
Lys 



ttc 
Phe 



agg 
Arg 



tea 
Ser 



gat 
Asp 
300 



etc att ttg cgt 
Leu He Leu Arg 



912 



aag gga gac ggg cat aag gtc gag gaa gtg ttg gta agg agg gta 
Lys Gly Asp Gly His Lys Val Glu Glu Val Leu Val Arg Arg Val 
305 310 315 



tga 



960 
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<210> 21 . ■ i 

<211> 882 

<212> DNA 1 

<213> Bacillus subtilis 



<220> 
<221> CDS 
<222> (1) 



(879) 



<400> 21 ( ■ 1 ' 

ttg teg att get gta tct gaa gaa gag gca aaa get gtg gaa- gga ttg 
Met Ser lie Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 

1 5 10 . ' ' 15 



48 



aat gat tat eta 1 tct gtt gaa gaa gtg gag acg ate tat att ccg ctt 
Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie Tyr lie Pro Leu 

20 25 30 



96 



gtt cgc ttg ctt cat tta cat gtc aag tct gcg get gaa cgc aat aag 
Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala Glu Arg Asn Lys 
35 40 45 1 



144 



cat gtc aat gtt ttt ttg aag cac cca cat tea gee aaa att ccg ttt 
His Val Asn Val Phe Leu Lys His Pro His Ser Ala Lys lie Pro Phe 
50 55 60 



192 



a tt ate ggc att gee ggc agt gtc gca gtc gga aaa age acg acg gcg 
lie lie Gly lie Ala Gly Ser Val Ala Val Gly Lys Ser Thr Thr Ala 
65 70 75 80 



240 



egg ate ttg cag aag ctg ctt teg cgt ttg cct gac cgt cca aaa gtg 
Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 

85 90 95 



288 



age ctt ate acg aca gat ggt ttt tta ttt cct act gee gag ctg aaa 
Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr Ala Glu Leu Lys 

100 105 110 



336 



aag aaa aat atg atg tea aga aaa gga ttt cct gaa age tat gat gta 
Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 



384 



aag gcg ctg etc gaa ttt ttg aat gac tta aaa tea gga aag gac age 
Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 



432 



gta aag gee ccg gtg tat tec cat eta ace tat gac cgc gag gaa ggt 
Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 



480 



gtg ttc gag gtt gta gaa cag gcg gat att gtg att att gaa ggc att 
Val Phe Glu Val Val Glu Gin Ala Asp lie Val He He Glu Gly He 

165 170 175 



528 



aat gtt ctt cag teg ccc ace ttg gag gat gac egg gaa aac ccg cgt 
Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 

180 185 190 



576 



WO 01/21772 



PCT/US00/25993 



- 24 - 



att 
He 



ttt 
Phe 



gtt 
Val 
195 



tec 
Ser 



gat 
Asp 



ttc 
Phe 



ttt 
Phe 



gat 
Asp 
200 



ttt 
Phe 



teg 
Ser 



att 
.He 



tat 
Tyr 



g.tg 
Val 
205 



gat 
Asp 



Ala 



gag 
Glu 



i 624 



gaa age egg att ttc 
Glu Ser Arg He Phe 
210 



act tgg, tat tta gag cgt ttt cgc ctg ctt egg 
Thr Trp Tyr Leu Glu ,Arg Phe Arg Leu Leu Arg 
215 - ■ * , 220' 



672 



gaa aca get ttt caa aat cct 
Glu Thr Ala Phe Gin Asn Pro 
225 230 



gat 
Asp 



tea 
Ser 



tat 
Tyr 



ttt 
Phe 
235 



ttg tec gat cag gag get gac gag atg gca gee 
Leu Ser Asp Gin Glu Ala Asp. Glu Met Ala Ala 

245 • 250 



cat 
His 



teg 
Ser 



aaa 
Lys 



att 
He 



ttt 
Phe 



tgg 
Trp 



aaa gac, 
Lys Asp 
240 



720 



gag 
Glu 
255 



agt 
Ser 



7 68 



gtc 
Val 



aac 
Asn 



egg 
Arg 



ccg 
Pro 
260 



aat 
Asn 



tta 
Leu 



tat 
Tyr 



gaa 
Glu 



aat 
Asn 
265 



att 
He 



ttg 
Leu 



cca 
Pro 



act 
Thr 



aaa 
Lys 
270 



ttc 
Phe 



agg 
Arg 



816 



tea gat etc att ttg cgt aag! gga gac ggg cat aag gtc gag gaa gtg 
Ser Asp Leu He Leu Arg Lys , Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 



864 



ttg gta agg agg gta 
Leu Val Arg Arg Val 
290 



tga 



882 



<210> 22 
<211> 846 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (843) 

<400> 22 

gtg gaa gga ttg aat gat tat eta tct gtt gaa gaa gtg gag acg ate 48 
Met Glu Gly Leu Asn Asp Tyr Leu Ser Val Glu Glu Val Glu Thr lie 
15 10 15 

tat att ccg ctt gtt cgc ttg ctt cat tta cat gtc aag tct gcg get 96 
Tyr He Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 

20 25 30 

gaa cgc aat aag cat gtc aat gtt ttt ttg aag cac cca cat tea gee 144 
Glu Arg Asn Lys His Val Asn Val Phe Leu Lys His Pro His Ser Ala 
35 4 0 4 5 

aaa att ccg ttt att ate ggc att gec ggc agt gtc gca gtc gga aaa 192 
Lys He Pro Phe He He Gly He Ala Gly Ser Val Ala Val Gly Lys 
50 55 60 

age acg acg gcg egg ate ttg cag aag ctg ctt teg cgt ttg cct gac 240 
Ser Thr Thr Ala Arg He Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp 
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1 

65 , , 70 , , '75 ■ ' 80 

cgt cca aaa gtg 1 age ctt ate acg aca gat g^t ttt tta ttt cct act 288 

Arg Pro Lys'Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 

'85 90 95 

i 

gec gag ctg aaa aag aaa aat atg atg tea aga aaa gga ttt cct gaa 33,6 

Ala Glu Leu Lys Lys ilys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 

100 ' 105 , , ' " '.' ' 1.10 

age tat^at gta aag gcg ctg cite 'gaa ttt ttg aat gac tta aaa tea 384 

Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 

1 115 i 120 ' < 125 

» ■ . ( • < 

gga aag gac age gta aag gee ccg gtg t'at , tec cat eta acc tat gac 432 

Gly Lys Asp Ser Val Lys Ala Pro V^l Tyr Ser His Leu Thr Tyr Asp 

130 135 140, 

i 

' 1 - 

cgc gag gaa ggt gtg ttc gag gtt gta gaa cag gcg ( gat att gtg att 480 

Arg Glu 1 Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp lie Val lie 

145 150 155 160 

> < i 

att gaa ggc att aat gtt ctt cag teg ccc acc ttg gag gat gac egg 528 

lie Glu Gly lie Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 

165 170 , 175 

gaa aac ccg cgt att ttt , gtt tec gat ttc ttt gat ttt teg att tat 576 

Glu Asn Pro Arg lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr 

180 185 190 

gtg gat gcg gag gaa age egg att ttc act tgg tat tta gag cgt ttt 624 

Val Asp Ala Glu Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe 

195 200 205 

i 

cgc ctg ctt egg gaa aca get ttt caa aat cct gat tea tat ttt cat 672 

Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 

210 215 220 

aaa ttt aaa gac ttg tec gat cag gag get gac gag atg gca gee teg 720 

Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 

225 230 235 240 

att tgg gag agt gtc aac egg ccg aat tta tat gaa aat att ttg cca 768 

lie Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro 

245 250 255 

act aaa ttc agg tea gat etc att ttg cgt aag gga gac ggg cat aag 816 
Thr Lys Phe Arg Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys 

260 265 270 

gtc gag gaa gtg ttg gta agg agg gta tga 846 
Val Glu Glu Val Leu Val Arg Arg Val 
275 280 



<210> 23 
<211> 831 
<212> DNA 
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ii' » 

<213> Bacillus i subtilis . ' 

• i 

<220> 

<221> CDS ' ' , 1 . ■ 

<222> (1) . . (831? 

<4oo> 23 , 

atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct ,gaa gaa ccg 48 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 

1 5 , ' 10 ' 15 

i ) 1 i 

att gt t c atg , ctg a'cc get tat gat tat 'ccg gca get aaa ctt' get gaa 96 

lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 

20 . 25 30 

caa gcg gga gtt gac atg att tta gtc ggt. gat tea ctt gga atg gtc 144 
Gin Ala Gly Val Asp Met lie Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 ' 45 



i 



gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 192 
Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 ' 55 60 . 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat acc ttt att gtg 240 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa gat acg ctg 288 
Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 

85 90 95 

aaa aat gca gcg get ate gtt cag gaa age gga get gac gca ctg aag 336 
Lys Asn Ala Ala Ala He Val Gin Glu Ser Gly Ala Asp Ala Leu Lys 

100 105 110 

ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt 384 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 120 125 

gga ggc att cca gta gtc agt cac tta ggt ttg aca ccg cag tea gtc 432 
Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 

ggc gta ctg ggc ggc tat aaa gta cag ggc aaa gac gaa caa age gee 480 
Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 

aaa aaa tta ata gaa gac agt ata aaa tgc gaa gaa gca gga get atg 528 
Lys Lys Leu lie Glu Asp Ser He Lys Cys Glu Glu Ala Gly Ala Met 

165 170 175 

atg ctt gtg ctg gaa tgt gtg ccg gca gaa etc aca gec aaa att gec 576 
Met Leu Val Leu Glu Cys Val Pro Ala Glu Leu Thr Ala Lys lie Ala 

180 185 190 

gag acg eta age ata ccg gtc att gga ate ggg get ggt gtg aaa gcg 624 
Glu Thr Leu Ser lie Pro Val lie Gly He Gly Ala Gly Val Lys Ala 
195 200 205 
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'gac gga caa gtt etc gtt tat cat gat att ! atc ggc cac ggt gtt ' gag , 672 

Asp Gly Gin Val Leu Val Tyr His Asp He He Gly' His Gly Val ,Glu 
210 215 ' ' 220 ■ , ■ 

aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa acc ate 720 

Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr. He 

225 230 , 235 . . " . 240 

t . 

gaa aca gca ate age gga tat! gtt cag gat gta aga cat cgt get tt,c 768 

Glu Thr Ala lie Ser Gly Tyr! Val Gin Asp Val Arg His 1 Arg Ala Phe' » 

245 j 250 255 

1 ! ' 

cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 816 

Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp ] Qly 

260 265 270 

ttg tac ggg gga aaa 831 
Leu Tyr Gly Gly Lys 
275 



i 

: 1 



<210> 24 ; 
<211> 277 
<212> PRT 

<213> Bacillus subtilis 
<400> 24 

Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
1 5 10 15 

He Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala Ala Lys Leu Ala Glu 

20 25 30 

Gin Ala Gly Val Asp Met He Leu Val Gly Asp Ser Leu Gly Met Val 
35 40 45 

Val Leu Gly Leu Asp Ser Thr Val Gly Val Thr Val Ala Asp Met He 
50 55 60 

His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe He Val 
65 70 75 80 

Thr Asp Met Pro Phe Met Ser Tyr His Leu Ser Lys Glu Asp Thr Leu 

85 90 95 

Lys Asn Ala Ala Ala He Val. Gin Glu Ser Gly Ala Asp Ala Leu Lys 

100 105 110 

Leu Glu Gly Gly Glu Gly Val Phe Glu Ser He Arg Ala Leu Thr Leu 
115 } 120 125 

Gly Gly He Pro Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
130 135 140 



Gly Val Leu Gly Gly Tyr Lys Val Gin Gly Lys Asp Glu Gin Ser Ala 
145 150 155 160 
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Lys Lys Leu He. Glu Asp Ser lie Lys Cys Glu Glu Ala Gly Ala^Met, 

165 ' 170 ' ' ' 175 



Met Leu Val Leu Glu Cys. Val Pro Ala Glu Leu Thr Ala Lys lie Ala 

180 185 ' 190 

Glu Thr Leu Ser He Pro Val He Gly lie Gly Ala Gly Val Lys Ala 

195 200 ' '205 



,Asp Gly Gin Val Leu Val Tyr 
210 ' 215 



His Asp He lie Gly His , Gly Val GlU 

220 



Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg He Asp Glu Thr He 
225 230 235 ,240 



r * 



Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 250 255 

: t 

Pro Glu Gin Lys His Ser Phe Gin Met Asn Gin Thr Val Leu Asp Gly 

260 ' 265 270 

i 

Leu Tyr Gly Gly Lys ! t 

275 i 



<210> 


25 






























<211> 


85 


8 




























<212> 


DNA 




























<213> 


Bacillus subtilis 






















<220> 
































<221> 


CDS 




























<222> 


(1) . . 


(858) 


























<400> 


25 






























atg aga 


cag 


att 


act 


gat 


att 


tea 


cag 


ctg 


aaa 


gaa 


gec 


ata 


aaa 


caa 


Met Arg 


Gin 


He 


Thr 


Asp 


He 


Ser 


Gin 


Leu 


Lys 


Glu 


Ala 


He 


Lys 


Gin 


1 








5 










10 














tac cat 


tea 


gag 


ggc 


aag 


tea 


ate 


gga 


ttt 


gtt 


ccg 


acg 


atg 


ggg 


ttt 


Tyr His 


Ser 


Glu 


Gly 


Lys 


Ser 


He 


Gly 


Phe 


Val 


Pro 


Thr 


Met 


Gly 


Phe 








20 










25 










30 






ctg cat 


gag 


ggg 


cat 


tta 


acc 


tta 


gca 


gac 


aaa 


gca 


aga 


caa 


gaa 


aac 


Leu His 


Glu 


Gly 


His 


Leu 


Thr 


Leu 


Ala 


Asp 


Lys 


Ala 


Arg 


Gin 


Glu 


Asn 






35 










40 










45 









48 



96 



144 



gac gec gtt att atg agt att ttt gtg aat cct gca caa ttc ggc cct 
Asp Ala Val He Met Ser He Phe Val Asn Pro Ala Gin Phe Gly Pro 
50 55 60 



192 



aat gaa gat ttt gaa gca tat ccg cgc gat att gag egg gat gca get 
Asn Glu Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 
65 70 75 80 



240 



ctt gca gaa aac gec gga gtc gat att ctt ttt acg cca gat get cat 
Leu Ala Glu Asn Ala Gly Val Asp He Leu Phe Thr Pro Asp Ala His 

85 90 95 



288 
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gat atg tat ccc ggt gaa, aag aat gtc acg att cat gt,a gaa. aga cgc 
Asp Met Tyr Pro' Gly Glu Lys Asn Val Thr lie His Val Glu Arg Arg 

TOO 1 105 ' . ■ ' 110 ' • 



336 



aca gac gtg tta tgc ggg cgc tea aga gaa gga cat ttt gac ggg gtc 
Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 

115 , .' 120 i ' ' 125 



384 



gcg ate gta ctg acg aag ctt ttc aat- eta gtc aag ccg act cgt gee 
A}a lie, Val Leu Thr Lys Leu Phe'Asn Leu Val Lys Pro Thr Arg Ala 
130 1 135 1 . 140 



432 



tat ttc ggt tta aaa gat gcg cag cag gta get gtt gtt gat ggg tta 
Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val, Ala Val Val Asp Gly Leu 
145 150 , , 155 160 



480 



ate age gac ttc ttc atg gat att gaa ttg gtt cct gtc gat acg gtc 
He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro , Val Asp Thr Val 
1 165 170 175 



528 



aga gag gaa gac ggc tta gee aaa age tct cgc aat gta tac' tta aca 
Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 

180 185 190 



576 



get gag gaa aga aaa gaa gcg cct aag ctg tat egg gec ctt caa aca 
Ala Glu Glu Arg Lys Glu< Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 ■ 200 205 



624 



agt gcg gaa ctt gtc caa gee ggt gaa aga gat cct gaa gcg gtg ata 
Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 
210 215 220 



672 



aaa get gca aaa gat ate att gaa acg act age gga ace ata gac tat 
Lys Ala Ala Lys Asp He He Glu Thr Thr Ser Gly Thr He Asp Tyr 
225 230 235 240 



720 



gta gag ctt tat tec tat ccg gaa etc gag cct gtg aat gaa att get 
Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 

245 250 255 



768 



gga aag atg att etc get gtt gca gtt get ttt tea aaa gcg cgt tta 
Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 

260 265 270 



816 



ata gat aat ate att att gat att cga gaa atg gag aga ata 
He Asp Asn He He He Asp He Arg Glu Met Glu Arg He 
275 280 285 



858 



<210> 26 
<211> 286 
<212> PRT 

<213>. Bacillus subtilis 



<400> 26 

Met Arg Gin He Thr Asp He Ser Gin Leu Lys Glu Ala He Lys Gin 
15 10 15 
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Tyr His Ser Glu Gly Lys , Ser He Gly Phe Val Pro Thr Met Gly Phe 

20 25 ' ' .30, 



Leu His Glu Gly 'His Leu Thr Leu Ala Asp Lys Ala Arg Gin Glu .Asn 
35 40 .45 

Asp Ala Val He Met Ser He Phe Val Asn Pro Ala Gin Phe Gly Pro 

50 . 55 . ; ' ' ".' . 60 , ' » 

Asn Glu, Asp Phe Glu Ala Tyr Pro Arg Asp He Glu Arg Asp Ala Ala 

65 ' ,70 '75 80 

i 

Leu Ala Glu -Asn Ala Gly Val Asp lie Leu Phe Thr Pro Asp Ala His, 

85 '90 ■ 95 

Asp Met Tyr Pro Gly Glu Lys Asn Val Thr He His Val Glu Arg Arg 

100 ' 105 ' 110 

Thr Asp Val Leu Cys Gly Arg Ser Arg Glu Gly His Phe Asp Gly Val 
115 120 125 

i 

Ala He Val Leu Thr Lys Leu Phe Asn Leu Val Lys Pro Thr Arg Ala 
130 135 140 

Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val Ala Val Val Asp Gly Leu 
145 150' 155 160 

He Ser Asp Phe Phe Met Asp He Glu Leu Val Pro Val Asp Thr Val 

165 170 175 

Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser Arg Asn Val Tyr Leu Thr 

180 185 190 

Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu Tyr Arg Ala Leu Gin Thr 
195 200 205 

Ser Ala Glu Leu Val Gin Ala Gly Glu Arg Asp Pro Glu Ala Val He 
210 215 220 

Lys Ala Ala Lys Asp He He Glu Thr Thr Ser Gly Thr He Asp Tyr 
225 230 235 240 

Val Glu Leu Tyr Ser Tyr Pro Glu Leu Glu Pro Val Asn Glu He Ala 

245 250 255 

Gly Lys Met He Leu Ala Val Ala Val Ala Phe Ser Lys Ala Arg Leu 

260 265 270 

He Asp Asn lie lie lie Asp lie Arg Glu Met Glu Arg lie 
275 280 285 



<210> 27 
<211> 381 
<212> DNA 

<213> Bacillus subtilis 



NSDOCID: <WO 0121772A2J_> 



WO 01/21772 



PCT/USOG/25993 



-31 - 



<220> 
<221> CDS 
<222> (1) 



(381) 



• , 1 . 



<400> 27 

atg tat cga aca atg 
Met Tyr Arg Thr Met 
1 5 



atg age ggc aaa ctt cac agg gca act gtt acg 
Met Ser Gly Lys Leu His Arg Ala Thr Val. Thr 

10 ' , 15 



48 



gaa gca aac ctg aac tat gtg gga age att aca att gat gaa gat ^c^c 96 
Glu Ala Asn Leu Asn Tyr Vail Gly Ser lie Thr lie Asp' Glu Asp Leu 

20 I 25 30 



att gat get gtg gga atg ctt cct aat gaa aaa gta caa att gtg aat 144 
lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin lie Val 'Asn 
35 ! 40 45 



aat aat aat gga gca cgt ctt gaa acg tat att att cct ggt 
Asn Asn Asn Gly Ala Arg Leu : Glu Thr Tyr lie lie Pro Gly 
50 ' 55 60 



aaa egg 
Lys Arg 



192 



gga 
Gly 
65 



age 
Ser 



ggc 
Gly 



gtc 
Val 



ata 
He 



tgc 
Cys 
70 



tta| 
Leu ; 



aac 
Asn 



ggt 
Gly 



gca 
Ala 



gec 
Ala 
75 



gca 
Ala 



cgc 
Arg 



ctt 
Leu 



gtg 
Val 



cag 
Gin 
8b 



240 



gaa gga gat aag gtc att att att tec tac aaa atg atg tct 
Glu Gly Asp Lys Val He He He Ser Tyr Lys Met Met Ser 

85 90 



gat caa 
Asp Gin 
95 



288 



gaa 
Glu 



gcg 
Ala 



gca 
Ala 



age 
Ser 
100 



cat 
His 



gag 
Glu 



ccg 
Pro 



aaa 
Lys 



gtg 
Val 
105 



get 
Ala 



gtt 
Val 



ctg 
Leu 



aat 
Asn 



gat 
Asp 
110 



caa 
Gin 



aac 
Asn 



336 



aaa att 
Lys lie 



gaa 
Glu 
115 



caa 
Gin 



atg 
Met 



ctg 
Leu 



ggg 

Gly 



aac 
Asn 
120 



gaa 
Glu 



cca 
Pro 



gec 
Ala 



cgt 
Arg 



aca 
Thr 
125 



att 
He 



ttg 
Leu 



381 



<210> 28 
<211> 127 
<212> PRT 

<213> Bacillus subtilis 
<400> 28 

Met Tyr Arg Thr Met Met Ser Gly Lys Leu His Arg Ala Thr Val Thr 
15 10 15 

Glu Ala Asn Leu Asn Tyr Val Gly Ser He Thr He Asp Glu Asp Leu 

20 25 30 

He Asp Ala Val Gly Met Leu Pro Asn Glu Lys Val Gin He Val Asn 
35 40 45 

Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie lie Pro Gly Lys Arg 
50 55 60 

Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala Ala Arg Leu Val Gin 
65 70 75 80 
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Glu Gly Asp Lys f Val lie, lie Ile ( Ser Tyr Lys Met Met Ser> Asp Gin 

85 90 ' ' , , 95 

> i • , . 

i ■ ' 

Glu Ala Ala Ser 'His. Glu Pro Lys Val Ala Val Leu Asn Asp Gin Asn 

100 105 110 

i * i . . 

Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala Arg Thr lie Leu 

115 120 • ; ' '■ V 125 1 

h i 

. . . , 1 
1 i i • ■ 

i i ' . , 

<210> 29 1 
<211> 894 1 ' , 

<212> DNA 

<213> Bacillus subtilis 

<220> ; 
<221> CDS 

<222>. (1) . . (894) , ' ' 

<400> 29 

atg aaa att gga att ate ggc gga ggc tec gtt ggt ctt tta' tgc gee 48 
Met Lys lie Gly lie lie Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
1 5 10 15 

tat tat ttg tea ctt tat cac gac gtg act gtt gtg acg agg egg caa 96 
Tyr Tyr Leu Ser Leu Tyr" His Asp Val Thr Val Val Thr Arg Arg' Gin 

20 25 30 

gaa cag get gcg gee att cag tct gaa gga ate egg ctt tat aaa ggc 144 
Glu Gin Ala Ala Ala lie Gin Ser Glu Gly lie Arg Leu Tyr Lys Gly 
35 40 45 

ggg gag gaa ttc agg get gat tgc agt gcg gac acg agt ate aat teg 192 
Giy Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser lie Asn Ser 
50 55 60 

gac ttt gac ctg ctt gtc gtg aca gtg aag cag cat cag ctt caa tct 240 
Asp Phe Asp Leu Leu Val Val Thr Val Lys Gin His Gin Leu Gin Ser 
65 70 75 80 

gtt ttt teg teg ctt gaa cga ate ggg aag acg aat ata tta ttt ttg 288 
Val Phe Ser Ser Leu Glu Arg lie Gly Lys Thr Asn lie Leu Phe Leu 

85 90 95 

caa aac ggc atg ggg cat ate cac gac eta aaa gac tgg cac gtt ggc 336 
Gin Asn Gly Met Gly His lie His Asp Leu Lys Asp Trp His Val Gly 

100 105 110 

cat tec att tat gtt gga ate gtt gag cac gga get gta aga aaa teg 384 
His Ser lie Tyr Val Gly lie Val Glu His Gly Ala Val Arg Lys Ser 
115 120 125 

gat aca get gtt gat cat aca ggc eta ggt gcg ata aaa tgg age gcg 432 
Asp Thr Ala Val Asp His Thr Gly Leu Gly Ala lie Lys Trp Ser Ala 
130 135 140 

ttc gac gat get gaa cca gac egg ctg aac ate ttg ttt cag cat aac 480 
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Phe Asp Asp Ala Glu Pro Asp Arg 
145 150 



Leu ,Asn lie Leu Phe Gin His Asn 
155 ' ■ 1 '160' 



cat teg gat ttt ccg att tat tat 
His Ser Asp Phe Pro lie Tyr Tyr 

165 



gag acg gat tgcf tac cgt ctg t ctg 
Glu Thr Asp Trp Tyr Arg Leu Leu 
170 175 



528 



acg ggc aag ctg att gta aat ; gcg 
Thr Gly Lys Leu lie Val Asn Ala 

180 



ttg caa gtg aaa aat gga gaa 
Leu Gin Val Lys Asn Gly Glu 
195 



ctg 
Leu 
200 



ttt atg aag ctg gta ttt cag : gag 
Phe Met Lys Leu Val Phe Gin Glu 
210 215 



tgt 
Cys 
185 

ctg 
Leu 



gca 
Ala 



att aat 
lie Asn 



aca acg 
Thr Thr 



tgc cgc 
Cys Arg 



cct , tta 
Pro Leu 



cca get 
Pro Ala 
205 

att tta 
lie Leu 
220 



act gcg tta 576 
Thr Ala Leu 
190 . , 

• t 
tat ctg get 624 
Tyr Leu Ala 



aaa ctt gaa 672 
Lys Leu Glu 



aat gaa gaa aag get tgg gag egg gtt cag gec gtt tgt ggg caa acg 
Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 
225 230 ! 235 240 



720 



aaa gag aat cgt tea tea atg ctg gtt gac gtc att gga ggc egg cag 
Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val lie Gly Gly Arg Gin 

245 250 255 



768 



acg gaa get gac gec att ate gga tac tta ttg aag gaa gca agt ctt 
Thr Glu Ala Asp Ala lie lie Gly Tyr Leu Leu Lys Glu Ala Ser Leu 

260 265 270 



816 



caa ggt ctt gat gee gtc cac eta gag ttt tta tat ggc age ate aaa 
Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Gly Ser lie Lys 
275 280 285 



864 



gca ttg gag cga aat aca aac aaa gtc ttt 
Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



894 



<210> 30 
<211> 298 
<212> PRT 

<213> Bacillus subtilis 
<400> 30 

Met Lys lie Gly lie Tie Gly Gly Gly Ser Val Gly Leu Leu Cys Ala 
15 10 15 

Tyr Tyr Leu Ser Leu Tyr His Asp Val Thr Val Val Thr Arg Arg Gin 

20 25 30 



Glu Gin Ala Ala Ala lie Gin Ser Glu Gly He Arg Leu Tyr Lys Gly 
35 40 45 



Gly Glu Glu Phe Arg Ala Asp Cys Ser Ala Asp Thr Ser He Asn Ser 
50 55 60 



WO 01/21772 



PCT/US00/25993 



-34- . 

Asp Phe Asp Leu Leu Val Val Thr Val Lys Gin His. Gin Leu Gin, Ser t ■ 

'65 70 ' 75 ' '. ' ' 80 » ' 

i 

' ( - ' 

Val Phe Ser Ser Leu Glu Arg lie Gly Lys Thr Asn lie Leu Phe 'Leu 

85 90 95 

Gin Asn Gly Met Gly His He His Asp Leu ,Lys Asp Trp His Val Gly 

100 1 105 , 1 110 ' 



,His Ser He Tyr Val Gly He 
115 < 



Val Glu His Gly Ala Val ( Arg Lys 'Ser, 
120 125 ' 



Asp Thr Ala Val Asp His Thr! Gly Leu Gly Ala He Lys Trp Ser Ala 
130 135; 140 



I 



i * • 



Phe Asp Asp Ala Glu Pro Asp : Arg Leu Asn He Leu Phe Gin His Asn 
145 150 155 160 

His Ser Asp Phe Pro He Tyr Tyr, Glu Thr Asp Trp Tyr Arg Leu Leu 

165 ' 170 175 

i 

Thr Gly Lys Leu He Val AsnlAla Cys He Asn Pro Leu Thr Ala Leu 

180 ! 185 190 i 

Leu Gin Val Lys Asn Gly Glu Leu Leu Thr Thr Pro Ala Tyr Leu Ala 
195 200 205 

Phe Met Lys Leu Val Phe Gin Glu Ala Cys Arg He Leu Lys Leu Glu 
210 215 . 220 

Asn Glu Glu Lys Ala Trp Glu Arg Val Gin Ala Val Cys Gly Gin Thr 
225 • 230 235 240 

Lys Glu Asn Arg Ser Ser Met Leu Val Asp Val lie Gly Gly Arg Gin 

245 250 255 

Thr Glu Ala Asp Ala He He Gly Tyr Leu Leu Lys Glu Ala Ser Leu 

260 265 270 

Gin Gly Leu Asp Ala Val His Leu Glu Phe Leu Tyr Gly Ser He Lys 
275 280 285 



Ala Leu Glu Arg Asn Thr Asn Lys Val Phe 
290 295 



<210> 31 
<211> 1725 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (1722) 

<400> 31 

atg ggg act aat gta cag gtg gat tea gca tct gec gaa tgt aca cag 48 
Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
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i 

1 5 10 1 15 

• i , . . • .i .... 

acg atg age gga 1 gca tta atg ctg att gaa tea tta aaa aaa gag aaa 96 

Thr Met Ser* Gly Ala Leu Met Leu lie Glu Ser Leu ,Ly's Lys'Glu Lys 

20 i 25 30 

gta gaa atg ate ttc ggt tat ccg ggc ggg get gtg ctt ccg att tac 144 

Val Glu Met lie Phe Gly Tyr Pro Gly Gly' Ala Val Leu Pro lie Tyr 

35 ' 40 : " 45 ' , 

i * ( 

gat aag eta tac aat tea ggg ttg i gta cat ate ctt ccc cgt cac gaa 192 

Asp Lys* Leu Tyr Asn Ser Gly Leu Va'l His lie Leu Pro Arg His Glu 

50 , 55 60 

caa gga gca att cat gca gcg gag gga tac gca agg gtc tec gga aaa 240 

Gin Gly Ala lie His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 

65 70 ' 75 . 80 

ccg ggt gtc gtc att gec acg tea ggg ccg gga gcg aca aac ctt gtt 288 

Pro Gly Val Val lie Ala Thr Ser Gly Pro Gly Ala 1 Thr Asn Leu Val 

85 90 95 

, r 

i I 

aca ggc ctt get gat gee atg att gat tea ttg ccg tta gtc gtc ttt 336 

Thr Gly Leu Ala Asp Ala Met lie Asp Ser Leu Pro Leu Val Val Phe 

100 105 110 

aca ggg cag gta gca ace, tct gta ate ggg age gat gca ttt cag gaa 384 

Thr Gly Gin Val Ala Thr Ser Val lie Gly Ser Asp Ala Phe Gin Glu 

115 120 125 

gca gac att tta ggg att acg atg cca gta aca aaa cac age tac cag 432 

Ala Asp lie Leu Gly lie Thr Met Pro Val Thr Lys His Ser Tyr Gin 

130 135 140 

gtt cgc cag ccg gaa gat ctg ccg cgc ate att aaa gaa gcg ttc cat 480 

Val Arg Gin Pro Glu Asp Leu Pro Arg lie lie Lys Glu Ala Phe His 

145 150 155 160 

att gca aca act gga aga ccc gga cct gta ttg att gat att ccg aaa 528 

lie Ala Thr Thr Gly Arg Pro Gly Pro Val Leu lie Asp He Pro Lys 

165 170 175 

gat gta gca aca att gaa gga gaa ttc age tac gat cat gag atg aat 576 

Asp Val Ala Thr He Glu Gly Glu Phe Ser Tyr Asp His Glu Met Asn 

180 185 190 

etc ccg gga tac cag ccg aca aca gag ccg aat tat ttg cag ate cgc 624 

Leu Pro Gly Tyr Gin Pro Thr Thr Glu Pro Asn Tyr Leu Gin He Arg 

195 200 205 

aag ctt gtg gaa gee gtg age agt gcg aaa aaa ccg gtg ate ctg gcg 672 

Lys Leu Val Glu Ala Val Ser Ser Ala Lys Lys Pro Val He Leu Ala 

210 215 220 

ggt gcg ggc gta ctg cac gga aaa gcg tea gaa gaa tta aaa aat tat 720 

Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 

225 230 235 240 



0 
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get gaa cag Gag caa ate cct gtg gca cac acc .,ctt ttg. ggg etc gga 768 , 
Ala Glu Gin Gin, 6ln lie. Pro Val Aia His Thr Leu Leu Gly Leu Gly 

245 ' 250 ' ' ,255 

ggc ttc ccg get 'gac cat ccg ctt ttc eta ggg atg gcg gga atg cac 816 

Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 

260 ■ , 265 , . ' 270 

i 1 1 

ggt act tat aca gee aat atg gee ctt cat gaa tgt gat eta tta ate 864 

Gly Thr Tyr Thr Ala Asn Met Ala Leu 1 His Glu Cys Asp 'Leu Leu lie 

, ,275 , 280 1 , ' 285 

t i 

agt ate ggc gec cgt ttt gat gac cgt gtc aca gga aac ctg aaa cac 912 
Ser lie Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His , 
290 295 ' 300 

ttt gec aga aac gca aag ata gee cac ate gat att gat cca get gaa 960 
Phe Ala Arg Asn Ala Lys lie Ala His lie Asp lie Asp Pro Ala Glu 
305 . , 310 315 f 320 

ate gga aaa ate atg aaa aca cag att cct gta gtc gga gac age aaa 1008 
lie Gly Lys lie Met Lys Thr Gin lie Pro Val Val Gly Asp' Ser Lys 

■ 325 330 335 

att gtc ctg cag gag ctg ate aaa caa gac ggc aaa caa age gat tea ' 1056 
lie Val Leu Gin Glu Leu lie Lys Gin Asp Gly Lys Gin Ser Asp Ser 

340 345 350 

age gaa tgg aaa aaa cag etc gca gaa tgg aaa gaa gag tat ccg etc 1104 
Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 
355 360 365 

tgg tat gta gat aat gaa gaa gaa ggt ttt aaa cct cag aaa ttg att 1152 
Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu lie 
370 375 380 

gaa tat att cat caa ttt aca aaa gga gag gec att gtc gca acg gat 1200 
Glu Tyr lie His Gin Phe Thr Lys Gly Glu Ala lie Val Ala Thr Asp 
385 390 395 400 

gta ggc cag cat caa atg tgg tea gcg caa ttt tat ccg ttc caa aaa 1248 
Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 

405 410 415 

gca gat aaa tgg gtc acg tea ggc gga ctt gga acg atg gga ttc ggt 1296 
Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 

420 425 430 

ctt ccg gcg gcg ate ggc gca cag ctg gec gaa aaa gat get act gtt 1344 
Leu Pro Ala Ala lie Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 

gtc gcg gtt gtc gga gac ggc gga ttc caa atg acg ctt caa gaa etc 1392 
Val Ala Val Val Gly Asp Gly Gly Phe Gin Met Thr Leu Gin Glu Leu 
450 455 460 

gat gtt att cgc gaa tta aat ctt ccg gtc aag gta gtg att tta aat 1440 
Asp Val lie Arg Glu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 
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i 

465 470 475 480, 

aac get tgt etc gga atg gtc aga cag tgg cag gaa 1 att ttc tat , gaa 1488 
Asn Ala Cys Leu Gly Met Val Arg ,Gln Trp 'Gin 1 Glii lie Phe Tyr ,Glu 

485 490 . ' 495 

gaa cgt tat tea gaa tct aaa ttc get tct cag cct gac ttc gtc. aaa 1536 
Glu Arg Tyr Ser Glu Ser Lys ( Phe Ala Ser Gin Pro, Asp Phe Val Lys 

500 505 ■ ' , 510 

ttg tec gaa gca tac ggcatt aaa ggc ate aga att tea' tea gaa gcg 1584 
Leu Ser Glu Ala Tyr Gly lie' Lys Gly lie Arg lie Ser Ser Glu Ala 
i 515 j 520 525 

gaa gca aag gaa aag ctg gaa : gag gca tta aca tea aga gaa cct'gtt 1632 
Glu Ala Lys Glu Lys Leu Glu' Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 535^ 540 

gtc att gac gtg egg gtt gee age gaa gaa aaa gta ttc ccg atg gtg 1680 
Val lie Asp Val Arg Val Ala: Ser Glu Glu Lys Val Phe Pro Met Val 

545 550 555 560 

i 

! , 

get ccg ggg aaa ggg ctg cat; gaa atg gtg ggg gtg aaa cct tga 1725 
Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 

565 570 



<210> 32 
<211> 574 
<212> PRT 

<213> Bacillus subtilis 
<400> 32 

Met Gly Thr Asn Val Gin Val Asp Ser Ala Ser Ala Glu Cys Thr Gin 
1 5 10 15 

Thr Met Ser Gly Ala Leu Met Leu lie Glu Ser Leu Lys Lys Glu Lys 

20 25 30 

Val Glu Met lie Phe Gly Tyr Pro Gly Gly Ala Val Leu Pro lie Tyr 
35 40 45 

Asp Lys Leu Tyr Asn Ser Gly Leu Val His lie Leu Pro Arg His Glu 
50 55 60 

Gin Gly Ala lie His Ala Ala Glu Gly Tyr Ala Arg Val Ser Gly Lys 
65 70 75 80 

Pro Gly Val Val lie Ala Thr Ser Gly Pro Gly Ala Thr Asn Leu Val 

85 90 95 

Thr Gly Leu Ala Asp Ala Met lie Asp Ser Leu Pro Leu Val Val Phe 

100 105 110 

Thr Gly Gin Val Ala Thr Ser Val lie Gly Ser Asp Ala Phe Gin Glu 
115 120 125 

Ala Asp lie Leu Gly lie Thr Met Pro Val Thr Lys His Ser Tyr Gin 
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130 



135 



140, 



i i 
i 



Val Arg Gin Pro Glu Asp Leu Pro Arg lie ,Ile Lys Glu Ala Phe 'His 
145 150. 155 ' . ■ 160 

i 

i ■ 

lie Ala Thr Thr Gly Arg Pro. Gly Pro Val Leu He Asp He Pro Lys 

165 170 , 175 



Asp Val Ala Thr He Glu Gly 

180 



Glu Phe Ser Tyr'Asp His Glu Met Asn 



185 



( 190 



Leu Pro Gly Tyr Gin Pro Thr Tnr Glu Pro Asn Tyr Leu Gin He Arg 



195 



200 



205 



I 



Lys Leu Val Glu Ala Val Ser; Ser Ala Lys Lys Pro Val lie Leu Ala 
210 215 : 220 



Gly Ala Gly Val Leu His Gly Lys Ala Ser Glu Glu Leu Lys Asn Tyr 
225 230 235 240 



Ala Glu Gin Gin Gin He Pre Val Ala His Thr Leu Leu Gly Leu Gly 



245 



250 



255 



Gly Phe Pro Ala Asp His Pro Leu Phe Leu Gly Met Ala Gly Met His 

260 265 270 

Gly Thr Tyr Thr Ala Asn Met Ala Leu His Glu Cys Asp Leu Leu He 
275 280 285 

Ser He Gly Ala Arg Phe Asp Asp Arg Val Thr Gly Asn Leu Lys His 
290 295 300 

Phe. Ala Arg Asn Ala Lys He Ala His lie Asp He Asp Pro Ala Glu 
305 310. 315 320 

He Gly Lys He Met Lys Thr Gin He Pro Val Val Gly Asp Ser Lys 

325 330 335 

He Val Leu Gin Glu Leu lie Lys Gin Asp Gly Lys Gin Ser Asp Ser 

340 345 350 

Ser Glu Trp Lys Lys Gin Leu Ala Glu Trp Lys Glu Glu Tyr Pro Leu 
355 360 365 

Trp Tyr Val Asp Asn Glu Glu Glu Gly Phe Lys Pro Gin Lys Leu He 
370 9 375 380 

Glu Tyr lie His Gin Phe Thr Lys Gly Glu Ala He Val Ala Thr Asp 
385 390 395 400 

Val Gly Gin His Gin Met Trp Ser Ala Gin Phe Tyr Pro Phe Gin Lys 

405 410 415 

Ala Asp Lys Trp Val Thr Ser Gly Gly Leu Gly Thr Met Gly Phe Gly 

420 425 430 

Leu Pro Ala Ala lie Gly Ala Gin Leu Ala Glu Lys Asp Ala Thr Val 
435 440 445 
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Val Ala Val Val Gly Asp Gly Gly 'Phe felii Met Thr Leu Gln i Glu Leu 

450 1 '455 1 i 460 ' ' 

t 

Asp Val lie Arg iGlu Leu Asn Leu Pro Val Lys Val Val lie Leu Asn 
465 470 475 480 

Asn Ala Cys Leu Gly Met Val Arg Gin Trp 'Glri Glu lie Phe T,yr Glu 

485 ' ' 490 " ' ( 495 

Glu Arg Tyr Ser Glu Ser Lys Phe lAla Ser Gin Pro Asp Pne Val Lys 
' J 500 ■ ' 505 , 510.. 

i 

Leu Ser Glu Ala Tyr Gly lie Lys Gly lie Arg lie Ser Ser Glu Ala 
515 520 - ( 525 

Glu Ala Lys Glu Lys Leu Glu Glu Ala Leu Thr Ser Arg Glu Pro Val 
530 . 535 , 540' 

Val He 1 Asp Val Arg Val Ala Ser Glu Glu Lys Val ! Phe Pro Met Val 
545 550 555 560 

i j j 

Ala Pro Gly Lys Gly Leu His Glu Met Val Gly Val Lys Pro 

565 570 



<210> 


33 


<211> 


525 


<212> 


DNA 


<213> 


Bacillus subtilis 


<220> 




<221> 


CDS 


<222> 


(1) . - (522) 


<400> 


33 


ttg aaa aga att ate aca 


Met Lys Arg lie lie Thr 


1 


5 



10 15 



48 



aac egg ate ace ggt eta ttc aca aaa agg cat tac aac att gaa age 96 
Asn Arg lie Thr Gly Leu Phe Thr Lys Arg His Tyr Asn lie Glu Ser 

20 25 30 

att aca gtt gga cac aca gaa aca gee ggc gtt tec aga ate acc ttc 144 
lie Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg lie Thr Phe 
35 40 45 

gtc gtt cat gtt gaa ggt gaa aat gat gtt gaa cag tta acg aaa cag 192 
Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys Gin 
50 55 60 

etc aac aaa cag att gat gtg ctg aaa gtc aca gac ate aca aat caa 240 
Leu Asn Lys Gin lie Asp Val Leu Lys Val Thr Asp lie Thr Asn Gin 
65 70 75 80 

teg att gtc cag agg gag ctg gec tta ate aag gtt gtc tec gca cct 288 
Ser lie Val Gin Arg Glu Leu Ala Leu lie Lys Val Val Ser Ala Pro 



4 
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85 



90 



95 



tea aca aga aca gag att aat gga ate ata gaa ccg' ttt aga gectet 
Ser Thr Arg Thr Glu lie Asn Gly lie lie Glu' Prp Phe Arg Ala iSe'r 

100 105 110 . 



336 



gtc gtt gat gtc age aga gac age ate gtt gtt cag gtg aca 
Val Val Asp Val Ser Arg Asp. Ser lie Val Val Gin' Val Thr 
115 120 125 . 



tct aac aaa att gaa gcg ctt 
Ser Asn Lys lie Glu Ala Leu 
' 130 135 



att gag tta tta aaa cct tat 
lie Glu Leu Leu Lys Pro Tyr 

140 



ggt gaa 384 
Gly Glu 



ggc att 4 32 
Gly lie 



aaa gaa ate gcg aga aca ggt; aca acg get ttt gcg agg gga 
Lys Glu lie Ala Arg Thr Gly ! Thr Thr Ala Phe Ala Arg Gly 
145 150 155 



j.' 



ace age 
Thr Ser 
160 



480 



aaa agg cgt cat cca ata aaa caa tat eta ttg tat aaa aca taa 
Lys Arg Arg His Pro lie' Lys Gin Tyr. Leu Leu Tyr Lys Thr 

165 170 



525 



<210> 34 
<211> 174 
<212> PRT 

<213> Bacillus subtilis 
<400> 34 

Met Lys Arg lie lie Thr Leu Thr Val Val Asn Arg Ser Gly Val Leu 
15 10 15 

Asn Arg lie Thr Gly Leu Phe Thr Lys Arg His Tyr Asn lie Glu Ser 

20 25 30 

lie Thr Val Gly His Thr Glu Thr Ala Gly Val Ser Arg He Thr Phe 
35 40 45 

Val Val His Val Glu Gly Glu Asn Asp Val Glu Gin Leu Thr Lys' Gin 
50 55 60 

Leu Asn Lys Gin He Asp Val Leu Lys Val Thr Asp He Thr Asn Gin 
65 70 75 80 

Ser He Val Gin Arg Glu Leu Ala Leu He Lys Val Val Ser Ala Pro 

85 90 95 

Ser Thr Arg Thr Glu He Asn Gly He He Glu Pro Phe Arg Ala Ser 

100 105 110 

Val Val Asp Val Ser Arg Asp Ser He Val Val Gin Val Thr Gly Glu 
115 120 125 

Ser Asn Lys He Glu Ala Leu He Glu Leu Leu Lys Pro Tyr Gly He 
130 135 140 



Lys Glu He Ala Arg Thr Gly Thr Thr Ala Phe Ala Arg Gly Thr Ser 
145 150 155 160 
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Lys Arg Arg His Pro lie Lys Gin Tyr Leu Leu Tyr Lys' Thr 

' 165. ' 1 ' 170 ' t 



<210> 35 
<211> 1029 

<212> DNA | , ! 

<213> Bacillus subtilis 



<220> 

<^2i> cbs 

<222> ' (1) . . (10,26) 



<400> 35 1 ( 

atg gta aaa gta tat tat aac ggt gat ate aaa gag aac gta ttg get 

Met Val Lys Val Tyr Tyr Asn Gly Asp lie Lys Glu Asn Val Leu Ala 
1 5 10 15 



48 



gga aaa 1 aca gta gcg gtt ate ggg tac ggt teg caa ggc cac gca cat 
Gly Lys Thr Val Ala Val He Gly Tyr Gly Ser Gin Gly His Ala His 

20 .25 30. 



96 



gee ctg aac ctt aaa gaa age gga gta gac gtg ate gtc ggt gtt aga 
Ala Leu Asn Leu Lys Glu Ser Gly Val Asp Val He Val Gly Val Arg 
35 40 , 45 



144 



caa gga aaa tct ttc act caa gec caa gaa gac gga cat aaa gta ttt 
Gin Gly Lys Ser Phe Thr' Gin Ala Gin Glu Asp Gly His Lys Val Phe 
50 55 60 



192 



tea gta aaa gaa gcg gca gec caa gec gaa ate ate atg gtt ctg ctt 
Ser Val Lys Glu Ala Ala Ala Gin Ala Glu He He Met Val Leu Leu 
65 70 75 80 



240 



ccg gat gag cag cag caa aaa gta tac gaa get gaa ate aaa gat gaa 
Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu He Lys Asp Glu 

85 90 95 



288 



ttg aca gca gga aaa tea tta gta ttc get cat gga ttt aac gtg cat 
Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 

100 105 110 



336 



ttc cat caa att gtt cct ccg gcg gat gta gat gta ttc tta gtg gec 
Phe His Gin He Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 



384 



cct aaa ggc ccg gga cac ttg gta aga aga aca tat gag caa gga get 
Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr. Tyr Glu Gin Gly Ala 
130 135 140 



432 



ggc gta cct gca ttg ttc gca ate tat caa gat gtg act gga gaa gca 
Gly Val Pro Ala Leu Phe Ala He Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 155 160 



480 



aga gac aaa gec etc get tat get aaa gga ate ggc ggc gca aga gcg 
Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly He Gly Gly Ala Arg Ala 

165 170 175 



528 
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ggc gta tta .gaa acg aca, ttt aaa gaa gaa aca gaa aca. gat ttg ttc 576 
Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 

"180 « 185 190 ' ' ' 

* 

ggt gag caa gca gtt ctt tgc ggc gga tta age gcg ctt gtc aaa gec 624 
Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 

195 , ' 200 ■ " ' 205 

i' . ■ ■ t 

gga ttt gaa acc tta act gaa gca ggt* tat cag cct gaa 'ctt gca tac 672 
G^y Phe, Glu Thr Leu Thr Glu Ala 'Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 1 . 220 

ttc gag tgt ctt cat gag ctg aaa tta ate gta gac ctt atg tac gaa , 720 

Phe Glu Cys Leu His Glu Leu Lys Leu lie Val Asp Leu Met Tyr Glu 

225 230 . , , 235 240 

■ 

gaa gga ctt gca gga atg aga tat tea ate tct gac aca gca cag tgg 768 

Glu Gly Leu Ala Gly Met Arg Tyr Ser lie Ser Asp, Thr Ala Gin Trp 

245 250 255 

gga gat ttc gta tea ggc cct cgcgtt gtg gac gec aaa gta' aaa gaa 816 
Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 

260 265 270 

tct atg aaa gaa gta tta aaa gat ate caa aac ggt aca ttc gca aaa 864 
Ser Met Lys Glu Val Leu Lys Asp lie Gin Asn Gly Thr Phe Ala Lys 
275 280 2.85 

gag tgg ate gtc gaa aac caa gta aac cgt cct cgt ttc aac get ate 912 
Glu Trp lie Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala lie 
290 295 300 

aat gca age gag aac gaa cat caa ate gaa gta gtg gga aga aag ctt 960 
Asn Ala Ser Glu Asn Glu His Gin lie Glu Val Val Gly Arg Lys Leu 
305 310 315 320 

cgt gaa atg atg ccg ttt gtg aaa caa ggc aag aag aag gaa gcg gtg 1008 
Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 

325 330 335 

gtc tec gtt gcg caa aat taa 1029 
Val Ser Val Ala Gin Asn 

340 



<210> 36 
<211> 342 
<212> PRT 
<213> Bacillus 

<400> 36 
Met Val Lys Val 
1 

Gly Lys Thr Val 

20 



subtilis 

Tyr Tyr Asn Gly 
5 

Ala Val lie Gly 



Asp lie Lys Glu 
10 

Tyr Gly Ser Gin 
25 



Asn Val Leu Ala 
15 

Gly His Ala His 
30 
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Ala Leu Asn Leu Lys Glu Ser Gly Val. Asp Val lie Val Gly Val Arg 

- ' 35 40 , ' : 45 ' ' , 1 ( 

r i 1 ■ 

i 

Gin Gly Lys Ser Phe Thr Gin Ala Gin Glu Asp Gly His Lys Val ,Phe 
50 55 1 .60 

Ser Val Lys Glu Ala Ala Ala Gin Ala Glu lie lie Met Val Leu. Leu 
65 70 75 , ■ , 80 

Pro Asp Glu Gin Gin Gin Lys Val Tyr Glu Ala Glu lie Lys Asp G^u 

85 I 90 1 95 ' 

I 

, Leu Thr Ala Gly Lys Ser Leu Val Phe Ala His Gly Phe Asn Val His 

100 105 110 

Phe His Gin lie Val Pro Pro Ala Asp Val Asp Val Phe Leu Val Ala 
115 120 125 

Pro Lys Gly Pro Gly His Leu Val Arg Arg Thr Tyr Glu Gin Gly Ala 
130 135 140 

Gly Val Pro Ala Leu Phe Ala lie Tyr Gin Asp Val Thr Gly Glu Ala 
145 150 ! ' 155 160 

Arg Asp Lys Ala Leu Ala Tyr Ala Lys Gly lie Gly Gly Ala Arg Ala 

165 170 175 

Gly Val Leu Glu Thr Thr Phe Lys Glu Glu Thr Glu Thr Asp Leu Phe 

180 185 190 

Gly Glu Gin Ala Val Leu Cys Gly Gly Leu Ser Ala Leu Val Lys Ala 
195 200 205 

Gly Phe Glu Thr Leu Thr Glu Ala Gly Tyr Gin Pro Glu Leu Ala Tyr 
210 215 220 

Phe Glu Cys Leu His Glu Leu Lys Leu lie Val Asp Leu Met Tyr Glu 
225 230 235 240 

Glu Gly Leu Ala Gly Met Arg Tyr Ser lie Ser Asp Thr Ala Gin Trp 

245 250 255 

Gly Asp Phe Val Ser Gly Pro Arg Val Val Asp Ala Lys Val Lys Glu 

260 265 270 

Ser Met Lys Glu Val Leu Lys Asp He Gin Asn Gly Thr Phe Ala Lys 
275 280 285 

Glu Trp He Val Glu Asn Gin Val Asn Arg Pro Arg Phe Asn Ala He 
290 295 3.00 

Asn Ala Ser Glu Asn Glu His Gin He Glu Val Val Gly Arg Lys Leu 
305 310 315 320 

Arg Glu Met Met Pro Phe Val Lys Gin Gly Lys Lys Lys Glu Ala Val 

325 330 335 



Val Ser Val Ala Gin Asn 
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340 



<210> 37 
<211> 1674 
<212> DNA 

<213> Bacillus subtilis 



<220> 

<221> CDS 

<222> (1) . V (1674) 



• <400> 37 

atg gca gaa tta cgc agt aat 
Met Ala Glu Leu Arg Ser Asn 



atg ate aca caa gga ate gat aga get 
Met lie Thr Gin Gly lie Asp Arg A'la 
10 15 



48 



ccg 
Pro 



cac 
His 



cgc 
Arg 



agt 
Ser 
20 



ttg 
Leu 



ctt 
Leu 



cgt 
Arg 



gca 
Ala 



gca 
Ala 
25 



ggg 

Gly 



gta 
Val 



aaa 
Lys 



gaa 
Glu 



gag 
Glu 
30 



gat ttc 
Asp Phe 



96 



ggc 
Gly 



aag 
Lys 



ccg 
Pro 
35 



ttt 
Phe 



att 
He 



gcg 
Ala 



gtg! 
Val - 



tgt 
Cys 
40 



aat 
Asn 



tea 
Ser 



tac 
Tyr 



att 
He 



gat 
Asp 
45 



ate 
He 



gtt 
Val 



ccc 
Prp 



144 



ggt 
Gly 



cat 
His 
50 



gtt 
Val 



cac 
His 



ttg 
Leu 



cag 
Gin 



gag 
Glu 
55 



ttt ggg aaa ate 
Phe Gly Lys He 



gta 
Val 
60 



aaa 
Lys 



gaa 
Glu 



gca 
Ala 



ate 
He 



192 



aga gaa gca ggg ggc gtt ccg ttt gaa ttt aat ace att ggg 
Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He Gly 
65 70 75 



gta gat 
Val Asp 
80 



240 



gat ggc ate gca atg ggg cat ate ggt atg aga tat teg ctg cca age 
Asp Gly He Ala Met Gly His He Gly Met Arg Tyr Ser Leu Pro Ser 

85 90 95 



288 



cgt gaa att ate gca gac tct gtg gaa acg gtt gta tec gca cac tgg 
Arg Glu He He Ala Asp Ser Val Glu Thr Val Val Ser Ala His Trp 

100 105 110 



336 



ttt gac gga atg gtc tgt att ccg aac tgc gac aaa ate aca ccg gga 
Phe Asp Gly Met Val Cys He Pro Asn Cys Asp Lys lie Thr Pro Gly 
115 120 125 



384 



atg ctt atg 
Met Leu Met 
130 



gcg gca 
Ala Ala 



atg cgc ate aac att ccg acg att ttt gtc age 
Met Arg He Asn He Pro Thr He Phe Val Ser 
135 140 



432 



ggc gga ccg atg gcg gca gga aga aca agt tac ggg cga aaa ate tec 
Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys He Ser 
145 150 155 160 



480 



ctt tec tea gta ttc gaa ggg gta ggc gee tac caa gca ggg aaa ate 
Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys He 

165 170 175 



528 



aac gaa aac gag ctt caa gaa eta gag cag ttc gga tgc cca acg tgc 



576 
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Asn Glu Asn Glu Leu Gin Glu Leu Glu Gin Phe Gly Cys 1 Pro Thr Cys 

180 , ■ 165 ' ' 190 

ggg tct tgc tea ggc atg ttt acg gcg aac tea atg aac tgt 'ctg tea 624 

Gly Ser Cys Ser ( Gly Met Phe Thr Ala Asn' Ser Met Asn Cys Leu Ser 

195 200 205 

gaa gca ctt ggt ctt get ttg ccg ggt aat; gga acc att ctg gca aca 672 

Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr lie Leu Ala Thr 

210 215 ' ' 220 , ' 

tct ccg 1 gaa cgc aaa gag ttt g'tg aga aaa teg get gcg caa, tta atg 720 

Ser Pro Glu 'Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 

225 230 , 235 ; 240 

t 

gaa acg att cgc aaa gat ate aaa ccg cgt ' gat att gtt aca gta aaa 768 

Glu Thr lie Arg Lys Asp He Lys. Pro Arg Asp He Val Thr Val Lys 

245 25(0 ' 255 

gcg att' gat aac, gcg ttt gca etc gat atg gcg etc 'gga ggt tct aca 816 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 

260 , 265 270, 

aat acc gtt ctit cat acc ctt gec ctt gca aac gaa gee ggc gtt gaa 864 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 

275 280 285 

tac tct tta gaa cgc att aac gaa gtc get gag cgc gtg ccg cac ttg 912 

Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 

290 295 300 

get aag ctg gcg cct gca teg gat gtg ttt att gaa gat ctt cac gaa 960 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe He Glu Asp Leu His Glu 
305 310 315 320 

gcg ggc ggc gtt tea gcg get ctg aat gag ctt teg aag aaa gaa gga 1008 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

325 330 335 

gcg ctt cat tta gat gcg ctg act gtt aca gga aaa act ctt gga gaa 1056 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 

340 345 350 

acc att gec gga cat gaa gta aag gat tat gac gtc att cac ccg ctg 1104 

Thr He Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 

355 360 365 

gat caa cca ttc act gaa aag gga ggc ctt get gtt tta ttc ggt aat 1152 

Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 

370 375 380 

eta get ccg gac ggc get ate att aaa aca ggc ggc gta cag aat ggg 1200 

Leu Ala Pro Asp Gly Ala He He Lys Thr Gly Gly Val Gin Asn Gly 
385 390 395 400 

att aca aga cac gaa ggg ccg get gtc gta ttc gat tct cag gac gag 1248 

He Thr Arg His Glu Gly Pro Ala Val Val Phe Asp Ser Gin Asp Glu 

405 410* 415 
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gcg ctt gac ggc att ate, aac cga aaa gta aaa gaa ggp gac gtt gtc 1296 
Ala Leu Asp Gly' lie lie Asn Arg Lys Val L^s Glu Gly Asp ( Val Val 

"420 ' 425 • 4.30 

i ■ 

ate ate aga tac gaa ggg cca aaa ggc gga cct ggc atg ccg gaa atg 1344 
lie lie Arg Tyr Glu Gly Pro Lys Gly Gly Pro Gly Met Pro Glu Met 
435 , ' 4 40 . '* 445 

ctg gcg cca aca tec caa ate gtt gga< atg gga etc ggg 'cca aaa gtg 1392 
Leu Ala ( E?ro Thr Ser Gin He Val 1 Gly Met Gly Leu Gly Pro Lys Val 
450 1 455 1 460 

i 

gca ttg att acg gac gga cgt ttt tec gga gec tec cgt ggc etc tea t 1440 
Ala Leu He Thr Asp Gly Arg Phe Ser £ly,Ala Ser Arg Gly Leu Ser 
465 470 , . 475 480 

i 1 

i 1 * 

I 

ate ggc cac gta tea cct gag gee get gafg ggc ggg ccg ctt gee ttt 1488 

He Gly His Val Ser Pro Glu Ala Ala Glu Gly Gly ( Pro Leu Ala Phe 

1 485 490 495 

gtt gaa aac gga gac cat att ate gtt gat att gaa aaa cgc' ate ttg 1536 
Val Glu Asn Gly Asp His He He Val Asp He Glu Lys Arg He Leu 

500 505 510 

gat gta caa gtg cca gaa gaa gag tgg gaa aaa cga aaa gcg aac tgg * 1584 
Asp Val Gin- Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 

aaa ggt ttt gaa ccg aaa gtg aaa ace ggc tac ctg gca cgt tat tct 1632 
Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

aaa ctt gtg aca agt gec aac acc ggc ggt att atg aaa ate 1674 
Lys Leu Val Thr Ser Ala Asn Thr Gly Gly He Met Lys He 
545 550 555 



<210> 38 
<211> 558 
<212> PRT 

<213> Bacillus subtilis 
<400> 38 

Met Ala Glu Leu Arg Ser Asn Met He Thr Gin Gly He Asp Arg Ala 
15 10 15 

Pro His Arg Ser Leu Leu Arg Ala Ala Gly Val Lys Glu Glu Asp Phe 

20 25 30 

Gly Lys Pro Phe He Ala Val Cys Asn Ser Tyr He Asp He Val Pro 
35 40 45 

Gly His Val His Leu Gin Glu Phe Gly Lys He Val Lys Glu Ala He 
50 55 60 

Arg Glu Ala Gly Gly Val Pro Phe Glu Phe Asn Thr He Gly Val Asp 
65 70 75 80 
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Asp Gly lie Ala Met Gly His lie Gly Met • Arg Tyr Ser Leu Pro' Ser 

85 90 ■ ' 95 , ' 

i , i 

' , • ' . ■ ' 

Arg Glu lie lie Ala Asp Ser Val Glu Thr Val Val Ser Ala Hxs Trp 

100 105 110 



Phe Asp Gly Met Val Cys lie, Pro Asn Cys Asp Lys ( lie Thr Pro Gly 
115 120 ' , 125 



Met Leu Met Ala Ala Met Arg lie Asn lie. Pro Thr lie 1 Phe Val Sef 
130 ' 135 140 



Gly Gly Pro Met Ala Ala Gly Arg Thr Ser Tyr Gly Arg Lys lie Ser 



145 



150 



155 



160 



Leu Ser Ser Val Phe Glu Gly Val Gly Ala Tyr Gin Ala Gly Lys lie 

165 170 ' 175 



Asn Glu Asn Glu Leu Gin • Glu Leu Glu Gin Phe Gly Cys Pro Thr Cys 

180 185 190 



Gly Ser Cys Ser Gly Met Phe' Thr Ala Asn Ser Met Asn Cys Leu Ser 
195 200 205 



Glu Ala Leu Gly Leu Ala Leu Pro Gly Asn Gly Thr lie Leu Ala Thr 
210 215 220 



S 



er Pro Glu Arg Lys Glu Phe Val Arg Lys Ser Ala Ala Gin Leu Met 
225 230 235 ' 240 



Glu Thr lie Arg Lys Asp He Lys Pro Arg Asp He Val Thr Val Lys 

245 250 255 

Ala He Asp Asn Ala Phe Ala Leu Asp Met Ala Leu Gly Gly Ser Thr 

260 265 270 

Asn Thr Val Leu His Thr Leu Ala Leu Ala Asn Glu Ala Gly Val Glu 
275 280 285 

Tyr Ser Leu Glu Arg He Asn Glu Val Ala Glu Arg Val Pro His Leu 
290 295 300 

Ala Lys Leu Ala Pro Ala Ser Asp Val Phe He Glu Asp Leu His Glu 
305 310 315 320 

Ala Gly Gly Val Ser Ala Ala Leu Asn Glu Leu Ser Lys Lys Glu Gly 

325 330 335 

Ala Leu His Leu Asp Ala Leu Thr Val Thr Gly Lys Thr Leu Gly Glu 

340 345 350 



Thr He Ala Gly His Glu Val Lys Asp Tyr Asp Val He His Pro Leu 
355 360 365 



Asp Gin Pro Phe Thr Glu Lys Gly Gly Leu Ala Val Leu Phe Gly Asn 
370 375 380 
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Leu Ala Pro Asp Gly Ala lie lie Lys /Thr Gly Gly Val Gin Asn Gly 

385 . ' 1 390, ' ' 395 " .,'*". 400 

lie Thr Arg"HisGlu Gly* Pro Ala Val Val, Phe Asp 'Ser Gin Asp Glu 

•405 410 415 

Ala Leu Asp Gly lie lie Asn Arg Lys Val t Lys Glu Gly Asp Val Val 

420 , ' 425 ' ' 430 < 

lie lie Arg Tyr Glu Gly Pro Lys Gly- Gly Pro Gly Met Pro Glu Met 
,435 4,4 0 1 445 

Leu Ala Pro Th'r Ser Gin lie Val Gly Met Gly Leu Gly Pro Lys Val 

450 455 460 ' 

i 

Ala Leu lie Thr Asp Gly Arg Phe Ser Gly Ala Ser Arg Gly Leu Ser 

465 ' 470 475 480 

lie Gly His . Val Ser Pro Glu Ala Ala Glu Gly Gly, , Pro Leu Ala Phe 
1 485 490 495 

i 

Val Glu Asn Gly Asp His lie lie Val Asp lie Glu Lys Arg' lie Leu 

500 505 510 

Asp Val Gin Val Pro Glu Glu Glu Trp Glu Lys Arg Lys Ala Asn Trp 
515 520 525 

Lys Gly Phe Glu Pro Lys Val Lys Thr Gly Tyr Leu Ala Arg Tyr Ser 
530 535 540 

Lys Leu Val Thr Ser Ala Asn Thr Gly Gly lie Met Lys He 

545 550 555 



<210> 39 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35__signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 39 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 

tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 

ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
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gcaaccccgc ctgt 194 

' '' ' 

<210> 40 , , ' ' ' ,' ■ < ' 

<211> 163 4 ' ' 

<212> DNA 

<213> Artificial Sequence 

<220> ' 
<223> Description of Artificial Sequence,: promoter 
j sequence 

<220> 

<221> -35_signal 
<222> (113) (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 



<400> 40 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 

i ' 

i I 

tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 



<210> 41 
<211> 127 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : promoter 
sequence 



<220> 

<221> -35_signal 
<222> (34) . . (39) 

<220> 

<221> -10_signal 
<222> (58) . . (63) 

<220> 

<221> -35_signal 
<222> (75) . . (80) 



<220> 

<221> -10_signal 
<222> (98) . . (103) 



<400> 41 

gaggaatcat agaattttgt caaaataatt 
aatttaaatt ttatttgaca aaaatgggct 



ttattgacaa cgtcttatta acgttgatat 60 
cgtgttgtac aataaatgta gtgaggtgga 120 
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tqcaatq 127 

■ 1 , 'i 

<210> 42 , - ' 

<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

i <4 0 l 0> 4 2 

taaacatgag gaggagaaaa catg ' .24 

! I i' ' 

<210> 43 ! 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> , 

<223> Description of Artificial Sequence : ribosome 

binding site , 

<400> 43 

' attcgagaaa tggagagaat ataatatg 28_ 

<210> 44 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 44 

agaaaggagg tga 13 

<210> 45 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 45 

ttaagaaagg aggtgannnn atg 23 

<210> 46 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
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binding site 



•t .- 

i , 



<400> 46 ' i i 
ttagaaagga ggtgannnnn atg 1 23 



<210> 47 

<211> 23 1 ' 

<212> DNA . ( ' 

<213> Artificial Sequence i ., , ' 

<220> , 

<223> Description of Artificial Sequence : ribosome 
binding site , 

<400> 47 ' " • 

agaaaggagg tgannnnnnn atg i 23 

<210> 4i8 
<211> 22 

<212> DNA , 
<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

i 

<400> 48 

agaaaggagg tgannnnnna tg 22 

<210> 49 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 49 

ccctctagaa ggaggagaaa acatg 25 

<210> 50 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 50 

ccctctagag gaggagaaaa catg 24 

<210> 51 
<211> 23 
<212> DNA 
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i 



<213> Artificial Sequence 

1 I i 1 • ■.' . * . ■ 

I ' I 1 ' . 

<220> ' . 

<223> Description of Artificial Sequence: ribosome ■ ■• 

binding site 

<400> 51 : 

ttagaaagga gga-tt^aaat: atg ( ' ' ' > 23 

<210> 52 
<^11> 23, 

<212> DNA ' . 

<213> Artificial Sequence *' ( 

<220> ' 1 , 

<223> Description of Artificial Sequence : ribosome 
binding site ! 

<400> 52 ( 

ttagaaa'gga ggtttaatta atg 23 

<210> 53 * 
<211> 23 
<212> DNA 

<213> Artificial Sequence • 

i 

<220> 

<223> - Description of Artificial Sequence : ribosome 
binding site 

<400> 53 

ttagaaagga ggtgatttaa atg 23 

<210> 54 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 54 

ttagaaagga ggtgtttaaa atg 23 

<210> 55 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

<400> 55 

attcgagaaa ggaggtgaat ataatatg 28 
<210> 56 
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i ' 

i » i ■ 

<211> 27 

( <212> DNA , ( . 

<213>. Artificial Sequence ' ' ' , • , ' 1 

<220> . ' , 

<223> Description of Artificial Sequence : ribosome 
binding site • 

r , • 

t 

<400> 56 ' . ' .'■ 

attcgagaaa ggaggtgaat aataatg 27 

<210> 57 
<211> 28 
' <212> DNA 

<213> Artificial Sequence | , 

<220> 

<223> Description of Artificial Sequence : ribosome 
binding site 

i 

<400> 57 

attcgtagaa aggaggtgaa ttaatatg 28 

i , 



<210> 58 
<211> 3291 
<212> DNA 

<213> Bacillus subtilis 
<400> 58 



atggggacta 


atgtacaggt 


ggattcagca 


tctgccgaat 


gtacacagac 


gatgagcgga 


60 


gcattaatgc 


tgattgaatc 


attaaaaaaa 


gagaaagtag 


aaatgatctt 


cggttatccg 


120 


ggcggggctg 


tgcttccgat 


ttacgataag 


ctatacaatt 


cagggttggt 


acatatcctt 


180 


ccccgtcacg 


aacaaggagc 


aattcatgca 


gcggagggat 


acgcaagggt 


ctccggaaaa 


240 


ccgggtgtcg 


t cat tgccac 


gtcagggccg 


ggagcgacaa 


accttgttac 


aggccttgct 


300 


gatgccatga 


ttgatt catt 


gccgt tagtc 


gtct ttacag 


ggcaggtagc 


aacctctgta 


360 


atcgggagcg 


atgcat ttca 


ggaagcagac 


attttaggga 


ttacgatgcc 


agtaacaaaa 


420 


cacagctacc 


aggttcgcca 


gccggaagat 


ctgccgcgca 


tcattaaaga 


agcgttccat 


480 


attgcaacaa 


ctggaagacc 


cggacctgta 


ttgattgata 


ttccgaaaga 


tgtagcaaca 


540 


attgaaggag 


aattcagcta 


cgatcatgag 


atgaatctcc 


cgggatacca 


gccgacaaca 


600 


gagccgaatt 


atttgcagat 


ccgcaagctt 


gtggaagccg 


tgagcagtgc 


gaaaaaaccg 


660 


gtgatcctgg 


cgggtgcggg 


cgtactgcac 


ggaaaagcgt 


cagaagaatt 


aaaaaattat 


720 


gctgaacagc 


agcaaatccc 


tgtggcacac 


acccttttgg 


ggctcggagg 


cttcccggct 


780 


gaccatccgc 


ttttcctagg 


gatggcggga 


atgcacggta 


cttatacagc 


caatatggcc 


840 
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* f t ' 

l t 

cttcatgaat gtgatctatt aatcagtatc ggcgcccgtt' ttgat'gaccg tgtcacaggai 90D 

* i r if 

* , i 

aacctgaaac actttgccag aaacgcaaag- atagcccaca tcgatattga tccagcttjaa 960 

i 

atcggaaaaa tcatgaaaac acagattcct gtagtcggag acagcaaaat tgtcctgcag 1020 

gagctgatca aacaagacgg caaacaaagc gattcaagc^g aatggaaaaa acagctcgca 1080 

'gaatggaaag aagagtatcc gctctggtat gtagataatg aagaagaagg ttttaaacct 1140 

cagaaattga ttgaatatat tcatcaattt acaaaaggag aggccattgt cgcaacggat 1200 

■ j ^ 

i • 

gtaggccagc atcaaatgtg gtcagcgcaa ttttatccgt tccaaaaagc agataaatgg 1260 

I i 1 > 

i 

i i 

gtcacgtcag gcggacttgg aacgatggga ttcggtcttc cggcggcgat cggcgcacag 1320 
ctggccgaaa aagatgctac tgttgtcgcg gttgtcggag acggcggatt ccaaatgacg 1380 

! t 

cttcaagaac tcgatgttat tc'gcgaatta aatcttccgg tcaaggtagt gattttaaat 1440 

aacgcttgtc tcggaatggt cagacagtgg caggaaattt tctatgaaga acgttattca 1500 

gaatctaaat tcgcttctca gcctgacttc gtcaaattgt ccgaagcata cggcattaaa 1560 

i 

ggcatcagaa tttcatcaga agcggaagca aaggaaaagc tggaagaggc attaacatca 1620 

agagaacctg ttgtcattga cgtgcgggtt gccagcgaag aaaaagtatt cccgatggtg 1680 

gctccgggga aagggctgca tgaaatggtg ggggtgaaac cttgaaaaga attatcacat 1740 

tgactgtggt gaaccgctcc ggggtgttaa accggatcac cggtctattc acaaaaaggc 1800 

attacaacat tgaaagcatt acagttggac acacagaaac agccggcgtt tccagaatca 1860 

ccttcgtcgt tcatgttgaa ggtgaaaatg atgttgaaca gttaacgaaa cagctcaaca 1920 

aacagattga tgtgctgaaa gtcacagaca tcacaaatca atcgattgtc cagagggagc 1980 

tggccttaat caaggttgtc tccgcacctt caacaagaac agagattaat ggaatcatag 2040 

aaccgtttag agcctctgtc gttgatgtca gcagagacag catcgttgtt caggtgacag 2100 

gtgaatctaa caaaattgaa gcgcttattg agttattaaa accttatggc attaaagaaa 2160 

tcgcgagaac aggtacaacg gcttttgcga ggggaaccag caaaaggcgt catccaataa 2220 

aacaatatct attgtataaa acataacaag ggagagattg aaatggtaaa agtatattat 2280 

aacggtgata tcaaagagaa cgtattggct ggaaaaacag tagcggttat cgggtacggt 2340 

tcgcaaggcc acgcacatgc cctgaacctt aaagaaagcg gagtagacgt gatcgtcggt 2400 

gttagacaag gaaaatcttt cactcaagcc caagaagacg gacataaagt attttcagta 2460 

aaagaagcgg cagcccaagc cgaaatcatc atggttctgc ttccggatga gcagcagcaa 2520 

aaagtatacg aagctgaaat caaagatgaa ttgacagcag gaaaatcatt agtattcgct 2580 
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catggattta acgt:gcattt ccatcaaatt igttcctccgg eggatgtaga tgtattct'ta 2640 
gtggccccta aaggcccggg acacttggta agaagaacat atgagcaagg agctggcgta 2700 

i 1 

i 

i 

cctgcattgt tcgcaatcta tcaagatgtg actggagaag caagagacaa agccctcgct 2760 
tatgctaaag gaatcggcgg cgcaagagcg ggcgtattag aaacgacatt taaagaagaa 2820 

acagaaacag atttgttcgg tgagcaagca gttctttgcg gcggattaag cgcgcttgtc 2880 

\ ■ i ' . ' 

aaagccgcijat ttgaaacctt aactgaagca ggttatcagc ctgaacttgc atacttcgag 2940 

■ i' 
i 

tgtcttcatg agctgaaatt aat cgtagac . cttatgtacg aagaaggact tgcaggaatg 3000 

t 

agatattcaa tctctgacac agcacagtgg ggagatttcg tatcaggccc tcgcgttgtg 3060 

' ■ 

i ■ ■ 

gacgccaaag taaaagaatc tatgaaaga'a gtatt,aaaag atatccaaaa cggtacattc 3120 

gcaaaagagt ggatcgtcga aaaccaagta aaccgtcctc gtttfcaacgc tatcaatgca 3180 

i 

agcgagaacg aacatcaaat cgaagtagtg ggaagaaagc ttcgtgaaat gatgccgttt 3240 
gtgaaacaag gcaagaagaa ggaagcggtg gtctccgttg cgcaaaatta a 3291 

* 

<210> 59 
<211> 2363 
<212> DNA 

<213> Bacillus subtilis 

<220> 
<221> CDS 

<222> (242) . . (1072) 

<220> ■ 
<221> CDS 

<222> (1077) . . (1934) 

<220> 
<221> CDS 

<222> (1939) . . (2319) 
<400> 59 

ttggtacaag cccgttgatt ttggtatact tccattgggc agtatcgcct gcgaactgca 60 

cctattatta aaatagatag acattgcagc agtctgcctt gatccaaaaa aggactggga 120 

cagagggatg aaactcgccg aactttagaa agtgaagaat ccttctcgtt gtaacggaag 180 

gttttttggc ttgcagaaga aaacggcaga tcatctcctc taaacatgag gaggagaaaa 240 

c atg aaa aca aaa ctg gat ttt eta aaa atg aag gag tct gaa gaa ccg 289 
Met Lys Thr Lys Leu Asp Phe Leu Lys Met Lys Glu Ser Glu Glu Pro 
15 10 15 

att gtc atg ctg acc get tat gat tat ccg gca get aaa ctt get gaa 337 
lie Val Met Leu Thr Ala Tyr Asp Tyr Pro Ala ftla Lys Leu Ala Glu 
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20 



25 



30 



caa gcg gga gtt gac atg att tta gtc ggt gat tea 1 ctt gga atg 1 gt:c 
Gin Ala Gly Val Asp Met lie Leu Val Gly Asp' Se,r Leu Gly Met .Vatl 
35 40 45 



385 



gtc etc ggc ctt gat tea act gtc ggt gtg aca gtt gcg gac atg ate 433 
Val Leu Gly Leu Asp Ser Thr. Val Gly Val Thr Val 1 Ala Asp Met lie 
50 55 60 

cat cat aca aaa gec gtt aaa agg ggt gcg ccg aat acc ttt att gtg 4 81 
His His Thr Lys Ala Val Lys Arg Gly Ala Pro Asn Thr Phe lie Val 



65 



70 



75 



80 



aca gat atg ccg ttt atg tct tat cac ctg tct aag gaa gat acg ctg 
Thr Asp Met Pro Phe Met Ser : Tyr His Leu Ser Lys Glu Asp Thr Leu 

85 90 95 



529 



aaa aat 
Lys Asn 



gca 
Ala 



gcg 
Ala 
100 



get ate gtt cag 
Ala lie Val Gin 



gaa 
Glu 
105 



age 
Ser 



gga 
Gly 



get 
Ala 



gac 
Asp 



gca 
Ala 
110 



ctg 
Leu 



aag 
Lys 



577 



ctt gag ggc gga gaa ggc gtg ttt gaa tec att cgc gca ttg acg ctt: 
Leu Glu Gly Gly Glu Gly Val Phe Glu Ser lie Arg Ala Leu Thr Leu 
115 120 125 



625 



gga ggc att cca 
Gly Gly lie Pro 
130 



gta gtc agt cac tta ggt ttg aca ccg cag tea gtc 
Val Val Ser His Leu Gly Leu Thr Pro Gin Ser Val 
135 140 



673 



ggc gta 
Gly Val 
145 



ctg ggc ggc tat aaa gta 
Leu Gly Gly Tyr Lys Val 

150 



cag ggc aaa gac gaa caa age gec 
Gin Gly Lys Asp Glu Gin Ser Ala 
155 160 



721 



aaa aaa tta ata 
Lys Lys Leu lie 



gaa gac agt ata aaa tgc gaa gaa gca gga get atg 
Glu Asp Ser lie Lys Cys Glu Glu Ala Gly Ala Met 
165 170 175 



769 



atg 
Met 



ctt 
Leu 



gtg 
Val 



ctg 
Leu 
180 



gaa 
Glu 



tgt 
Cys 



gtg 
Val 



ccg 
Pro 



gca 
Ala 
185 



gaa 
Glu 



etc 
Leu 



aca 
Thr 



gee 
Ala 



aaa 
Lys 
190 



att 
lie 



gec 

Ala 



817 



gag 
Glu 



acg 
Thr 



eta 
Leu 
195 



age 
Ser 



ata 
He 



ccg 
Pro 



gtc 
Val 



att 
He 
200 



gga 
Gly 



ate 
He 



ggg 

Gly 



get 
Ala 



ggt 
Gly 
205 



gtg 
Val 



aaa 
Lys 



gcg 
Ala 



865 



gac gga caa gtt etc gtt tat cat gat att ate ggc cac 
Asp Gly Gin Val Leu Val Tyr His Asp He He Gly His 
210 215 220 



ggt gtt gag 
Gly Val Glu 



913 



aga aca cct aaa ttt gta aag caa tat acg cgc att gat gaa 
Arg Thr Pro Lys Phe Val Lys Gin Tyr Thr Arg lie Asp Glu 
225 230 235 



acc ate 
Thr He 
240 



961 



gaa aca gca ate age gga tat gtt cag gat gta aga cat cgt get ttc 
Glu Thr Ala He Ser Gly Tyr Val Gin Asp Val Arg His Arg Ala Phe 

245 250 255 



1009 
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cct gaa caa aag cat tec ttt caa atg aac cag aca gtg ctt gac ggc 1057 

Pro Glu Gin Lys pis Sex Phe Gin Met Asn Gin Thr Val Leu Asp Gly 

260. 1 ■ 265 i r ' 270' 

■ i - i 

1 • . 1 l , ' 1 

ttg tac ggg gga aaa taag atg aga cag att act gat att tea cag ctg 1106 
Leu Tyr Gly Gly Lys Met Arg Gin lie Thr Asp lie $er Gin Leu 

275 280 '. 285 

i ■■ , 

aaa gaa gec ata aaa caa tac cat tea gag ggc aag tea ate gga ttt 1154 

Lys Glu Ala lie Lys Gin Tyr His Ser Glu Gly Lys Ser ,1-1 e Gly Phe 

290 295, ' 300 

' i • . . 

gtt ccg acg at f g ggg ttt ctg cat gag ggg cat tta ace tta gca gac 1202 

Val Pro Thr Met Gly Phe Leu His Glu Gly His Leu Thr Leu Ala Asp 

305 310 . 315 1 

i 

aaa gca aga caa gaa aac gac gec gtt att atg agt att ttt gtg aat 1250 

Lys Ala Arg Gin Glu Asn Asp Ala Val lie Met Ser lie Phe Val Asn 

320 325 330 .335 



i 



■i 



cct gca caa ttc ggc cct aat gaa gat ttt gaa gca tat ccg cgc gat 1298 

Pro Ala Gin Phe Gly Pro Asn Glu Asp Phe Glu Ala Tyr Pro, Arg Asp 

340 345 350 

att gag egg gat gca get ctt gca gaa aac gec gga gtc gat att ctt , 1346 

lie Glu Arg Asp Ala Ala Leu Ala Glu Asn Ala Gly Val Asp lie Leu 

355 ' 360 365 

i 

ttt acg cca gat get cat gat atg tat ccc ggt gaa aag aat gtc acg 1394 

Phe Thr Pro Asp Ala His Asp Met Tyr Pro Gly Glu Lys Asn Val Thr 
370 375 380 

att cat gta gaa aga cgc aca gac gtg tta tgc ggg cgc tea aga gaa 1442 

He His Val Glu Arg Arg Thr Asp Val Leu Cys Gly Arg Ser Arg Glu 
385 390 395 

gga cat ttt gac ggg gtc gcg ate gta ctg acg aag ctt ttc aat eta 1490 

Gly His Phe Asp Gly Val Ala He Val Leu Thr Lys Leu Phe Asn Leu 

400 405 410 415 

gtc aag ccg act cgt gee tat ttc ggt tta aaa gat gcg cag cag gta 1538 

Val Lys Pro Thr Arg Ala Tyr Phe Gly Leu Lys Asp Ala Gin Gin Val 

420 425 430 

get gtt gtt gat ggg tta ate age gac ttc ttc atg gat att gaa ttg 1586 

Ala Val Val Asp Gly Leu lie Ser Asp Phe Phe Met Asp He Glu Leu 

435 440 445 

gtt cct gtc gat acg gtc aga gag gaa gac ggc tta gee aaa age tct 1634 

Val Pro Val Asp Thr Val Arg Glu Glu Asp Gly Leu Ala Lys Ser Ser 
450 455 460 

cgc aat gta tac tta aca get gag gaa aga aaa gaa gcg cct aag ctg 1682 

Arg Asn Val Tyr Leu Thr Ala Glu Glu Arg Lys Glu Ala Pro Lys Leu 
465 470 475 

tat egg gec ctt caa aca agt gcg gaa ctt gtc caa gee ggt gaa aga 1730 

Tyr Arg Ala Leu Gin Thr Ser Ala Glu Leu Val Gin Ala Gly Glu Arg 
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480 , , 485 , , 490 ' 495 

1 i . ' ' ' , . 

gat cct gaa gcg gtg ata aaa get gca aaa gat ate att gaa acg act 1778 
Asp Pro Glu ' Ala Val lie Lys Ala Ala Lys , Asp lie .lie Glu Thr Thr 
\ '500 505 510 

* 

age gga acc ata gac tat gta gag ctt tat tec tat ccg gaa etc gag 18,26 
Ser Gly Thr lie TVsp Tyr Val Glu Leu Tyr ' Ser Tyr Pro Glu Leu Glu 

515 520 , , • ' 525 

. '. ■ . ' 

cct gtg aat gaa att get gga a'.ag ! atg att etc get gtt gca gtt get 1874 
Pro Val' Asn Glu lie Ala Gly Lys Met lie Leu Ala Val Ala' Val Ala 
' 530 i 535 ' , 540 

i 

ttt tea aaa gcg cgt tta ata gat aat a'tc , att att gat att cga gaa 1922 
Phe ^Ser Lys Ala Arg Leu lie Asp Asn lie lie lie Asp lie Arg Glu 

545 550 555 

i 

atg gag aga ata taat atg tat cga aca atg atg age ggc aaa ctt cac 1971 
Met Giu 1 Arg lie' Met Tyr Arg Thr Met Met Ser Gly Lys Leu His 

560 565 570 

agg gca act gtt acg gaa gca aac ctg aac tat gtg gga age att aca 2019 
Arg Ala Thr Val Thr Glu Ala Asn Leu Asn Tyr Val Gly Ser lie Thr 
575 580 585 590 

att gat gaa gat etc att gat get gtg gga atg ctt cct aat gaa aaa 2067 
lie Asp Glu Asp Leu lie Asp Ala Val Gly Met Leu Pro Asn Glu Lys 

595 600 605 

gta caa att gtg aat aat aat aat gga gca cgt ctt gaa acg tat att 2115 
Val Gin lie Val Asn Asn Asn Asn Gly Ala Arg Leu Glu Thr Tyr lie 

610 615 620 

att cct ggt aaa egg gga age ggc gtc ata tgc tta aac ggt gca gee 2163 
lie Pro Gly Lys Arg Gly Ser Gly Val lie Cys Leu Asn Gly Ala Ala 
625 630 635 

gca cgc ctt gtg cag gaa gga gat aag gtc att att att tec tac aaa 2211 
Ala Arg Leu Val Gin Glu Gly Asp Lys Val lie lie lie Ser Tyr Lys 
640 645 650 

atg atg tct gat caa gaa gcg gca age cat gag ccg aaa gtg get gtt 2259 
Met Met Ser Asp Gin Glu Ala Ala Ser His Glu Pro Lys Val Ala Val 
655 660 665 670 

ctg aat gat caa aac aaa att gaa caa atg ctg ggg aac gaa cca gee 2307 
Leu Asn Asp Gin Asn Lys lie Glu Gin Met Leu Gly Asn Glu Pro Ala 

675 680 685 

cgt aca att ttg tagaagaaaa gcccccttta tcgggggttt tcttttaaga tttt 2363 
Arg Thr lie Leu 

690 



<210> 60 
<211> 293 
<212> PRT 
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<213> Bacillus subtilis 



> i 



! 



<400> 60 i f 

Met Ser lie Ala Val Ser Glu Glu Glu Ala Lys Ala Val Glu Gly Leu 

1 5 ' ' 10 , 1 . 15 ' 



Asn Asp Tyr Leu Ser Val Glu* Glu Val Glu Thr lie Tyr lie Pro Leu 

20 ,25 30 



, Val Arg Leu Leu His Leu His 

35 



Val Lys Ser Ala Ala Glu Arg Asn Lys 



4.0 



45 



His Val Asn Val Phe Leu Lys! His Pro His Ser Ala Lys lie Pro Phe 



50 



55 



60 



i.' 



lie lie Gly lie Ala Gly Ser: Val Ala Val Gly Lys Ser Thr Thr Ala 

65 70 75 80 

Arg lie Leu Gin Lys Leu Leu Ser Arg Leu Pro Asp Arg Pro Lys Val 

85 , 90 95 



Ser Leu lie Thr Thr Asp Gly 

100 



Phe Leu Phe Pro Thr Ala Glu Leu Lys 
! 105 110 



Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu Ser Tyr Asp Val 
115 120 125 

Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser Gly Lys Asp Ser 
130 135 140 

Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp Arg Glu Glu Gly 
145 150 155 160 

Val Phe Glu Val Val Glu Gin Ala Asp lie Val lie lie Glu Gly lie 

165 170 175 

ft 

Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg Glu Asn Pro Arg 

180 185 190 

lie Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr Val Asp Ala Glu 
195 200 205 

Glu Ser Arg lie Phe Thr Trp Tyr Leu Glu Arg Phe Arg Leu Leu Arg 
210 215 220 

Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His Lys Phe Lys Asp 
225 230 235 240 

Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser lie Trp Glu Ser 

245 250 255 

Val Asn Arg Pro Asn Leu Tyr Glu Asn lie Leu Pro Thr Lys Phe Arg 

260 265 270 



Ser Asp Leu lie Leu Arg Lys Gly Asp Gly His Lys Val Glu Glu Val 
275 280 285 



Leu Val Arg Arg Val 
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290 



<210> 61 
<211> 281 
<212> PRT 

<213> Bacillus subtilis 
<400> 61 

'Met Glu Gly Leu Asn Asp Tyr 

1 ' . 5 



Leu Ser Val Glu Glu Val Glu Thr lie, 
10 ,• ' 15 



Tyr lie Pro Leu Val Arg Leu Leu His Leu His Val Lys Ser Ala Ala 

20 25 30 

I i' » 

i " i- 

Glu Arg Asn Lys His Val Asn : Val Phe Leu Lys His Pro His Ser Ala 
35 40 45, 

Lys lie Pro Phe He He Gly* lie Ala Gly Ser Val Ala Val Gly Lys 

50 1 55 ; 60 

Ser Thr Thr Ala Arg lie Leu I Gin Lys Leu Leu Ser Arg Leu Pro Asp 
65 70 ! 75 8Q 

Arg Pro Lys Val Ser Leu lie Thr Thr Asp Gly Phe Leu Phe Pro Thr 

85 90 95 

Ala Glu Leu Lys Lys Lys Asn Met Met Ser Arg Lys Gly Phe Pro Glu 

100 105 110 

Ser Tyr Asp Val Lys Ala Leu Leu Glu Phe Leu Asn Asp Leu Lys Ser 
115 . 120 125 

Gly Lys Asp Ser Val Lys Ala Pro Val Tyr Ser His Leu Thr Tyr Asp 
130 135 140 

Arg Glu Glu Gly Val Phe Glu Val Val Glu Gin Ala Asp He Val He 
145 150 155 160 

He Glu Gly He Asn Val Leu Gin Ser Pro Thr Leu Glu Asp Asp Arg 

165 170 175 

Glu Asn Pro Arg He Phe Val Ser Asp Phe Phe Asp Phe Ser lie Tyr 

180 185 190 

Val Asp Ala Glu Glu Ser Arg He Phe Thr Trp Tyr Leu Glu Arg Phe 
195 200 205 

Arg Leu Leu Arg Glu Thr Ala Phe Gin Asn Pro Asp Ser Tyr Phe His 
210 215 220 

Lys Phe Lys Asp Leu Ser Asp Gin Glu Ala Asp Glu Met Ala Ala Ser 
225 230 235 240 

He Trp Glu Ser Val Asn Arg Pro Asn Leu Tyr Glu Asn He Leu Pro 

245 250 255 

Thr Lys Phe Arg Ser Asp Leu He Leu Arg Lys Gly Asp Gly His Lys 
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260 



265 



' 270 



Val Glu Glu Val' Leu Val Arg Ar^ Val 
275 " 280 



<210> 62 

<211> 1092 , 
<212> DNA 

<213> Bacillus subtilis 



<220> 

<221> 'CDS ( 
<222> (1) . . (1089) 



<400> 62 

atg act aaa caa aca att cgc gtt gaa ttg aca tea aca aaa aaa ccg 

Met Thr Lys Gin Thr lie Arg Vai Glu Leu Thr Ser Thr Lys Lys Pro 
1 5 10 15. 



48 



aaa cca gac cca aat cag ctt teg ttc gga aga gtg ttt aca gac cac 
Lys Pro Asp Pro Asn Gin Leu Ser. Phe Gly Arg Val Phe Thr Asp His 

20 25 30 



96 



atg ttt gta atg gac tat gec gca gat aaa ggt tgg tac gat cca aga 
Met Phe Val Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 
35 40 45 



144 



ate att cct tat caa ccc tta tea atg gat cca act gca atg gtc tat 
lie lie Pro Tyr Gin Pro Leu Ser Met Asp Pro Thr Ala Met Val Tyr 
50 55 60 



192 



cac tac ggc caa ace gtg ttt gaa ggg tta aag get tac gtg tea gag 
His Tyr Gly Gin Thr Val Phe Glu Gly Leu Lys Ala Tyr Val Ser Glu 
65 70 75 80 



240 



gat gac cat gtt ctg ctt ttc aga ccg gaa aaa aat atg gaa cgc ctg 
Asp Asp His Val Leu Leu Phe Arg Pro Glu Lys Asn Met Glu Arg Leu 

85 90 95 



288 



aat caa tea aac gac cgc etc tgc ate ccg caa att gat gaa gaa cag 
Asn Gin Ser Asn Asp Arg Leu fys lie Pro Gin lie Asp Glu Glu Gin 

100 105 110 



336 



gtt ctt gaa ggc tta aag cag ctt gtc gca att gat aaa gac tgg att 
Val Leu Glu Gly Leu Lys Gin Leu Val Ala lie Asp Lys Asp Trp lie 
115 120 125 



384 



cca aat gcg gag ggc acg tec ctt tac ate cgt ccg ttc ate ate gca 
Pro Asn Ala Glu Gly Thr Ser Leu Tyr lie Arg Pro Phe lie lie Ala 
130 135 140 



432 



acc gag cct ttc ctt ggt gtt gcg gca tct cat acg tat aag etc ttg 
Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 
145 150 155 160 



480 



ate att ctt tct ccg gtc ggc tct tat tac aaa gaa ggc att aag ccg 
lie lie Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly lie Lys Pro 



528 
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, , 165 , ,170 ' 175 



i 



gtc aaa ate get gtt gaa agt gaa ttt gtc cijt gcg gta aaa ggc gga 576 
Val Lys lie' Ala Val Glu Ser Glu Phe Val, Arg Ala .Vai Lys'dy Gly 

180 1 185 190 

< 

aca gga aat gec aaa acc gca gga aac tat get tea age tta aaa gcg 624 
Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 

195 200 , ■ 1 " 205 i 

► r 

cag cag, gta gee gaa gag aaa gga 1 ttt tct caa gta etc tgg ctg gac 672 
Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp ■ Leu Asp 
210 i 215 , 220 , 

ggc att gag aag aaa tac ate gaa gaa gtc. gga age atg aac ate ttc 720 
Gly lie Glu Lys Lys Tyr lie Glu Glu Val. Gly Ser Met Asn lie Phe 
225 ' 230 / 235 240 

ttc aaa ate aac ggt gaa ate gta aca ccg atg ctg aac ggg age ■ ate 768 
Phe Lys lie Asn Gly Glu lie Val Thr Pro Met Leu Asn Gly Ser lie 

245 250 255 

ctg gaa ggc att acg cgc aat tea gtc ate gec ttg ctt aag cat tgg 816 
Leu Glu Gly lie Thr Arg Asn Ser Val lie Ala Leu Leu Lys His Trp 

260 265 270 

ggc ctt caa gtt tea gaa cga aaa att gcg ate gat gag gtc ate caa 864 
Gly Leu Gin Val Ser Glu Arg Lys lie Ala lie Asp Glu Val lie Gin 
275 280 285 

gee cat aaa gac ggc ate ctg gaa gaa gec ttc gga aca ggt aca gca 912 
Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 
290 295 300 

get gtt att tec cca gtc ggc gag ctg ate tgg cag gat gaa aca ctt 960 
Ala Val He Ser Pro Val Gly Glu Leu He Trp Gin Asp Glu Thr Leu 
305 310 315 320 

teg ate aac aac ggt gaa aca gga gaa ate gca aaa aaa eta tat gac 1008 
Ser He Asn Asn Gly Glu Thr Gly Glu He Ala Lys Lys Leu Tyr Asp 

325 330 335 

acg att aca ggc att caa aaa ggc get gtc gca gac gaa ttc gga tgg 1056 
Thr He Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 

340 345 350 

acg acc gaa gtc gca gcg ctg act gaa age aag taa 1092 
Thr Thr Glu Val Ala Ala Leu Thr Glu Ser Lys 
355 360 



<210> 63 
<211> 363 
<212> PRT 

<213> Bacillus subtilis 



<400> 63 

Met Thr Lys Gin Thr lie Arg Val Glu Leu Thr Ser Thr Lys Lys Pro 
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10 



15 



Lys Pro Asp 



Pro Asn 
20 



Gin Leu Ser Phe Gly Arg Val* Phe Thr Afep.His 

.25 ' * . * 30 , 



Met Phe Val 
35 



Met Asp Tyr Ala Ala Asp Lys Gly Trp Tyr Asp Pro Arg 

' 40 45 



lie lie Pro 
50 

His Tyr Gly 
65 



Tyr Gin Pro Leu Ser Met Asp Pro. Thr Ala Met Val Tyr 



Gin Thr 



55 

Val Phe 

70 I 



60 



Glu Gly Leu Lys Ala Tyr Val 

75 



Ser Glu 
80 



Asp Asp His 



Val Leu 
85 



Leu Phe: Arg Pro Glu Lys Asn Met Glu 

! 90 



Arg 1 Leu 
95 ' 



Asn Gin Ser 



Asn Asp 
100 



Arg Leu Cys lie Pro Gin lie Asp Glu Glu Gin 

105 110 



Val Leu Glu 
115 

Pro Asn Ala 
130 



Gly Leu Lys Gin Leu Val Ala lie Asp Lys Asp Trp lie 

I 120 125 



Glu Gly Thr Ser 1 Leu Tyr lie Arg Pro Phe lie 

135 140 



lie Ala 



Thr Glu Pro Phe Leu Gly Val Ala Ala Ser His Thr Tyr Lys Leu Leu 

145 150 155 160 

lie lie Leu Ser Pro Val Gly Ser Tyr Tyr Lys Glu Gly lie Lys Pro 

165 170 175 

Val Lys lie Ala Val Glu Ser Glu Phe Val Arg Ala Val Lys Gly Gly 

180 185 190 

Thr Gly Asn Ala Lys Thr Ala Gly Asn Tyr Ala Ser Ser Leu Lys Ala 

195 200 205 

Gin Gin Val Ala Glu Glu Lys Gly Phe Ser Gin Val Leu Trp Leu Asp 

210 215 0 220 

Gly lie Glu Lys Lys Tyr lie Glu Glu Val Gly Ser Met Asn lie Phe 

225 230 235 240 

Phe Lys lie Asn Gly Glu lie Val Thr Pro Met Leu Asn Gly Ser He 

245 250 255 

Leu Glu Gly He Thr Arg Asn Ser Val He Ala Leu Leu Lys His Trp 

260 265 270 

Gly Leu Gin Val Ser Glu Arg Lys He Ala He Asp Glu Val He Gin 

275 280 285 



Ala His Lys Asp Gly He Leu Glu Glu Ala Phe Gly Thr Gly Thr Ala 
290 295 300 



Ala Val He Ser Pro Val Gly Glu Leu He Trp Gin Asp Glu Thr Leu 
305 310 315 320 
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I 

I 

I 

* I ■ , . .'.,»!, ., ,. .... I 

Ser lie Asn Asri Gly Glu t Thr Gly Glu lie Ala Lys Ly^ Leu, Tyr Asp 

325 ' 330 1 • 335 

i • i 

Thr lie Thr Gly He Gin Lys Gly Ala Val Ala Asp Glu Phe Gly Trp 

340 345 350 

Thr Thr Glu Val ^la A!la Leu Thr Glu Ser Lys 

355 , 360 , . ' " , 

* i ■. ■ ' 

<210> 64 ' - 

<211> 1071 t 
<212> DNA 

<213> Bacillus subtilis 
<220> 

<221> CDS ' ■ ' 

<222> (1) • . (1068) , 

<400> 64 , 
ttg aat aag ctt att gaa cga ; gaa aaa act gta tat tat aag 1 gaa aag 48 
Met Asn Lys Le,u lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
1 5 10 15 

ccc gac ccg tct tec ttg ggg ttt gga caa tat ttt aca gat tat atg 96 
Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe Thr Asp Tyr Met 

20 25 30 

ttt gtg atg gac tac gaa gag ggg att gga tgg cat cat ccg aga att 144 
Phe Val Met Asp Tyr Glu Glu Gly lie Gly Trp His His Pro Arg lie 
35 40 45 

gcg ccg tac gca ccg ctt acg ctt gat ccg tct tea tct gtt ttt cat 192 
Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 
50 ' 55 60 

tac ggc cag get gtt ttt gaa gga tta aaa gca tac aga aca gac gac 240 
Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 
65 70 75 80 

ggc agg gtg ctg ctg ttc cgt ccg gat caa aat ate aaa egg ctg aae 288 
Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys Arg Leu Asn 

85 90 95 

aga teg tgt gag cgc atg age atg ccc cct tta gac gaa gag ctg gtg 336 
Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Val 

100 105 110 

ctt gag gca ttg acg caa tta gtt gag ctg gag aaa gat tgg gtt cca 384 
Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 

aag gaa aaa gga acg tea ctg tat att cgt cct ttt gtc att gee aca 432 
Lys Glu Lys Gly Thr Ser Leu Tyr lie Arg Pro Phe Val lie Ala Thr 
130 135 140 

gaa ccg agt etc ggt gtg aag gca tec agg age tat aca ttt atg ate 480 
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Glu Pro Ser 
145 



Leu Gly 



Val 
150 



Lys Ala Ser Arg 



Ser Tyr Thr Phe Met lie 

155. 1 ' » 1 ' '160' 



gtg ctt teg cct 
Val Leu Ser Pro 



gtc ggc t,cc tat tat ggc 
Val Gly Ser Tyr Tyr Gly 
165 170 



gac ' gat cag ctg aag peg 
Asp Asp Gin Leu Lys Pro 

175 



528 



gtt aga ate 
Val Arg lie 



gtc ggg ttt 
Val. Gly Phe 
195 

cag egg aaa 
Gin Arg Lys 
210 



tat gtc gaa gat t gag tat gtg 
Tyr Val Glu Asp Glu Tyr Val 
180 185 

i 

gca aaa acg get gga aac tat 
Ala Lys Thr Ala j Gly Asn Tyr 

200 



gcg aat gaa ctg ! ggc tat gac 
Ala Asn Glu Leu Gly Tyr Asp 

215 



agg gcg 
Arg .Ala 



gee gee 
Ala Ala 



cag gta 
Gin Val 
220 



gtc aac 
Val Asn 
i90 

I 

t 

agt ctt 
Ser Leu 
205 

ctg tgg 
Leu Trp 



gga gga 57 6 
Gly Gly 



cag gca 624 
Gin Ala 



ctg gac 672 
Leu Asp 



gec ate gaa aag 
Ala lie Glu Lys 
225 

ttc gtc ata aac 
Phe Val lie Asn 



aaa 
Lys 



ggg 

Gly 
245 



tat 
Tyr 
230 

gaa 
Glu 



gtg gaa gaa gta 
Val Glu Glu Val 



get gtc aca cct 
Ala Val Thr Pro 

250 



ggg age atg aac ate ttt 720 
Gly Ser Met Asn lie Phe 
235 240 

get tta age gga age att 768 
Ala Leu Ser Gly Ser lie 

255 



tta age ggg gtt aca cgt gcg tct gcg att gaa ttg att cga age tgg 
Leu Ser Gly Val Thr Arg Ala Ser Ala lie Glu Leu lie Arg Ser Trp 

260 265 270 



816 



ggc att ccg gtt cgt gaa gag aga ata teg att gat gag gtg tat gcg 
Gly lie Pro Val Arg Glu Glu Arg lie Ser lie Asp Glu Val Tyr Ala 
275 280 285 



8 64 



gec tct gca cgc gga gaa ttg aca gag gtc ttt ggc aca ggc acg gca 
Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr Gly Thr Ala 
290 295 300 



912 



gca gtc gtt acg cct gtc ggt gaa etc aac ate cat gga aaa acg gtg 
Ala Val Val Thr Pro Val Gly Glu Leu Asn lie His Gly Lys Thr Val 
305 310 315 320 



960 



att gta ggc gac ggg caa ate ggg gac etc teg aaa aag ctg tat gaa 
lie Val Gly Asp Gly Gin lie Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

325 330 335 



1008 



acg ata aca gat att cag ctt ggc aag gta aaa ggc ccg ttt aac tgg 
Thr lie Thr Asp lie Gin Leu Gly Lys Val Lys Gly Pro Phe Asn Trp 

340 345 350 



1056 



aca gtg gaa gtg tga 
Thr Val Glu Val 
355 



1071 



<210> 65 
<211> 356 
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<212> PRT 

<213> Bacillus subtilis 



<4 00> 65 ' , •■ • ' 

Met Asn Lys Leu lie Glu Arg Glu Lys Thr Val Tyr Tyr Lys Glu Lys 
1 5 10 15 

Pro Asp Pro Ser Ser Leu Gly Phe Gly Gin Tyr Phe 1 Thr Asp Tyr Met 

20 25 .30 



Phe Val Met Asp Tyr Glu Glu 
35 



i i 

Gly lie Gly Trp> His His Pro Arg lie 
4 0 4 5 



Ala Pro Tyr Ala Pro Leu Thr Leu Asp Pro Ser Ser Ser Val Phe His 

50 55= 60 1 '■' ' 

Tyr Gly Gin Ala Val Phe Glu Gly Leu Lys Ala Tyr Arg Thr Asp Asp 

65 70 75 80 

Gly Arg Val Leu Leu Phe Arg Pro Asp Gin Asn lie Lys Arg Leu Asn 

85 90 95 



Arg Ser Cys Glu Arg Met Ser Met Pro Pro Leu Asp Glu Glu Leu Va}. 

100 105 110 

Leu Glu Ala Leu Thr Gin Leu Val Glu Leu Glu Lys Asp Trp Val Pro 
115 120 125 

Lys Glu Lys Gly Thr Ser Leu Tyr lie Arg Pro Phe Val lie Ala Thr 
130 135 140 

Glu Pro Ser Leu Gly Val Lys Ala Ser Arg Ser Tyr Thr Phe Met lie 
145 150 155 160 

Val Leu Ser Pro Val Gly Ser Tyr Tyr Gly Asp Asp Gin Leu Lys Pro 

165 170 175 

Val Arg He Tyr Val Glu Asp Glu Tyr Val Arg Ala Val Asn Gly Gly 

180 185 190 

Val Gly Phe Ala Lys Thr Ala Gly Asn Tyr Ala Ala Ser Leu Gin Ala 
195 200 205 

Gin Arg Lys Ala Asn Glu Leu Gly Tyr Asp Gin Val Leu Trp Leu Asp 
210 215 220 

Ala He Glu Lys Lys Tyr Val Glu Glu Val Gly Ser Met Asn He Phe 
225 230 235 240 

Phe Val He Asn Gly Glu Ala Val Thr Pro Ala Leu Ser Gly Ser He 

245 250 255 

Leu Ser Gly Val Thr Arg Ala Ser Ala He Glu Leu He Arg Ser Trp 

260 265 270 



Gly He Pro Val Arg Glu Glu Arg He Ser He Asp Glu Val Tyr Ala 
275 280 285 
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Ala Ser Ala Arg Gly Glu Leu Thr Glu Val Phe Gly Thr. Gly Thr Ala 
290 '!. 295 300 . 

■ . . . ' '•' " 

Ala Val Val M Thr Pro Val Gly Glu Leu Asn lie His Gly Lys < Thr Val 
305 310 ' '315 ' J . 320 

lie Val Gly Asp Gly Gin lie Gly Asp Leu Ser Lys Lys Leu Tyr Glu 

325 . 330. . ' 335 

Thr lie Thr Asp lie Gin Leu Gly Lys Val Lys Gly'. Pro E*he Asn Trp 

340 , ( ,345' 350 

Thr Val Glu Val ' 

355 1 ■ 



<210> 66 . , ' 

<211> 1428 
<212> DNA 

<213> Bacillus subtilis * 

<220> ( 

<221> CDS 

<222> (1) . . (1425) 

<400> 66 

atg tta aac ggc caa aaa gaa tat cgc gtg gaa aaa gac ttc ctt ggg 48 
Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
1 5 10 15 

gaa aaa caa att gaa gca gat gtt tat tac gga att cag acg etc cgt 96 
Glu Lys Gin lie Glu Ala Asp Val Tyr Tyr Gly lie Gin Thr Leu Arg 

20 25 30 

get tct gaa aat ttt ccg ate aca gga tac aaa ate cat gag gaa atg 144 
Ala Ser Glu Asn Phe Pro lie Thr Gly Tyr Lys lie His Glu Glu Met 
35 40 45 

att aac gca ctg gcg att gtg aaa aaa get gcg get ctt gec aac atg 192 
lie Asn Ala Leu Ala lie Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

gac gtg aaa egg ctg tat gaa gga att ggc caa get ate gta caa gec 240 
Asp Val Lys Arg Leu Tyr Glu Gly lie Gly Gin Ala lie Val Gin Ala 
65 70 75 80 

get gac gag att ctg gaa ggc aag tgg cac gat cag ttt ate gtc gat 288 
Ala Asp Glu lie Leu Glu Gly Lys Trp His Asp Gin Phe lie Val Asp 

• 85 90 95 

ccg att cag ggc ggt gec gga act tct atg aac atg aac gcg aat gag 336 
Pro lie Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 

100 105 110 



gtt ate gga aac egg gcg ctt gaa ate atg gga cat aaa aag gga gat 384 
Val lie Gly Asn Arg Ala Leu Glu lie Met Gly His Lys Lys Gly Asp 
115 120 125 
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tat ate cat tta agt 
Tyr lie His Leu Ser 
130 



cca aac 
Pro, Asn 
135 



aca cat 
Thr His 



,gtg aac 
Val Asri 



atg 
Met 
140 



tea cag tct cag 
Ser. Gin. Ser Gin 



432 



aac gat 
Asn Asp 
145 



gtg ttc ' ccg 
Val Phe Pro 



act 
Thr 
150 



get ate cat 
Ala lie His 



att tec aca 
lie Ser Thr 
155 ■ 



ttg aag etc 
Leu Lys Leu 



tta 
Leu 
160 



480 



gaa aaa 
Glu Lys 



ctg ctg aaa aca atg 
Leu Leu Lys Thr Met 
165 



gaa gat 
Glu Asp 



atg cat agt gtg titt 
Met His Ser Val Phe 
170 



aaa caa 
Lys Gin 
175 



528 



aaa gca 
Lys Ala 



cag ga'g ttt cac tct 
Gin Glu Phe His Ser 
180 



gtt att 
Val lie 
185 



aaa atg 
Lys Met 



ggc egg aca cac ctt 
Gly Arg Thr His Leu 
190 



576 



caa gat 
Glri Asp 

■ i 



gcg gtt ccg ate cgt 
Ala Val Pro lie Arg 
195 



ctt' ggc 
Leu Gly 
200 



cag gaa 
Gin Glu 



ttc gaa get tac age 
Phe Glu Ala Tyr Ser 
205 



teg 



" — ^ 



cgt gtt etc gag cgt gat ate aaa cga ate aag caa 
Arg Val Leu Glu Arg Asp lie Lys Arg lie Lys Gin Ser Arg Gin His 
210 , 215 220 



624 



679 



ctg tat gaa gtc aac atg ggc 
Leu Tyr Glu Val Asn Met Gly 
225 230- 



gca act get gtt ggt aca 
Ala Thr Ala Val Gly Thr 

235 



ggg ctg 
Gly Leu 



aac 
Asn 
240 



720 



get gat cct gaa tat ate aaa cag gta gta aag cac ctt get gat att 
Ala Asp Pro Glu Tyr lie Lys Gin Val Val Lys His Leu Ala Asp lie 

245 250 255 



768 



age ggg ctt cct ctt gtc ggc get gat cat ctt gtt gat gcg aca caa 
Ser Gly Leu Pro Leu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 

260 265 270 



816 



aat aca gat gee tat aca gag gta tea get tea tta aaa gtc tgc atg 
Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 



864 



atg aac atg teg aag ate gca aac gac ctg cgc tta atg gcg 
Met Asn Met Ser Lys lie Ala Asn Asp Leu Arg Leu Met Ala 
290 295 300 



teg gga 
Ser Gly 



912 



ccg cgc gee gga ctt gcg gaa att tct ctg cct gca cgt cag 
Pro Arg Ala Gly Leu Ala Glu lie Ser Leu Pro Ala Arg Gin 
305 310 315 



ccg ggt 
Pro Gly 
320 



960 



tea tct att atg ccg ggg aaa gtc aat ccg gtt atg gcg gag ctg ate 
Ser Ser lie Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu lie 

325 330 335 



1008 



aac caa att gcg ttc cag gtt ate gga aat gac aat aca ate tgc ctt 
Asn Gin lie Ala Phe Gin Val lie Gly Asn Asp Asn Thr lie Cys Leu 

340 345 350 



1056 



get tea gaa gee ggc cag ctt gag ttg aac gtc atg gag ccc gtg ctt 
Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 



1104 
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355 



360 



365 



gtc ttt aat ttg ctt caa tec ate age ate atg aac aac ggc ttc.cgt 
Val Phe Asn Leu Leu Gin Ser lie ,Ser lie 'Met' Asri Asn Gly Phe ( Arg 
370 375 1 380 



1152 



teg ttc act gac aac tgc tta! aaa ggc att gaa gec aac gaa 
Ser Phe Thr Asp Asn Cys Leu t Lys Gly lie 'Glu Ala, Asn Glu 
385 390 . 355, 



aag 
Lys 



cgt 
Arg 
400 



'atg 
Met 



aag 
Lys 



caa 
Gin 



tac 

i 

Tyr 



gta 
Val 
405 



gaa 
Glu 



aaa 
Lys 



age gca ggc gtg ate aca' gqt 
Ser Ala Gly Val lie Thr Aia 
410 



gtc aat 
Val Asn 
415 



1200 



1248 



ccg cat ctt ggg tat gaa gcg: gca get aga att gec agg gaa 
Pro His Leu Gly Tyr Glu Ala! Ala Ala Arg lie Ala Arg Glu 

420 425 430 



gca la'tt 
Ala He 



1296 



atg aca ggg caa tct gtc egg gat ctt tgt ctg cag cat gat gtg ctg 1344 
Met Thr Gly Gin Ser Val i Arg Asp Leu Cys Leu Gin His Asp Val Leu 
435 , 440 445 

act gaa gaa gaa ttg gat att| att tta aac cca tat gag atg ace aaa 1392 
Thr Glu Glu Glu Leu Asp He lie Leu Asn Pro Tyr Glu Met Thr Lys 
450' 455. 460 



cca ggt ate gca ggg aaa gaa eta tta gaa aaa taa 
Pro Gly lie Ala Gly Lys Glu Leu Leu Glu Lys 
465 470 475 



1428 



<210> 67 
<211> 475 
<212> PRT 

<213> Bacillus subtilis 
<400> 67 

Met Leu Asn Gly Gin Lys Glu Tyr Arg Val Glu Lys Asp Phe Leu Gly 
15 10 15 

Glu Lys Gin He Glu Ala Asp Val Tyr Tyr Gly He Gin Thr Leu Arg 

20 25 30 

Ala Ser Glu Asn Phe Pro He Thr Gly Tyr Lys He His Glu Glu Met 
35 40 45 

lie Asn Ala Leu Ala He Val Lys Lys Ala Ala Ala Leu Ala Asn Met 
50 55 60 

Asp Val Lys Arg Leu Tyr Glu Gly He Gly Gin Ala He Val Gin Ala 
65 70 75 80 

Ala Asp Glu He Leu Glu Gly Lys Trp His Asp Gin Phe He Val Asp 

85 90 95 



Pro He Gin Gly Gly Ala Gly Thr Ser Met Asn Met Asn Ala Asn Glu 

100 105 110 
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Val lie Gly Asn Arg Ala Leu Glu He Met Gly His Lys Lys Gly, Asp> 

' 115 * 120 ' " 125 

Tyr lie His Leu Ser Pro Asn Thr His Val Asn Met Ser Gin Ser 'Gin 

130 135 140 

Asn Asp Val Phe Pro Thr Ala- lie His lie Ser Thr Leu Lys Leu Leu 

145 150 . 155 » ' ' 160 



Glu Lys Leu Leu Lys Thr Met 

165 



Glu Asp Met His Ser Val Phe Lys Gin 
170 ' 175 



Lys Ala Gin Glu Phe His Ser! Val lie Lys Met Gly Arg Thr His Leu 

180 1 185 190 . 



i ■ • 



Gin Asp Ala Val Pro lie Arg ; Leu Gly Gin Glu Phe Glu Ala Tyr Ser 
195 200 205 



Arg Val Leu Glu Arg Asp IleLys Arg He Lys Gin Ser Arg Gin His 
210 215 ! 220 

Leu Tyr Glu Val Asn Met Gly! Ala Thr Ala Val Gly Thr Gly Leu Asn 

225 230 ! ' . 235 24Q 



Ala Asp Pro Glu Tyr He Lys Gin Val Val Lys His Leu Ala Asp He 

245 250 255 

Ser Gly Leu Pro lieu Val Gly Ala Asp His Leu Val Asp Ala Thr Gin 

260 265 270 

Asn Thr Asp Ala Tyr Thr Glu Val Ser Ala Ser Leu Lys Val Cys Met 
275 280 285 

Met Asn Met Ser Lys He Ala Asn Asp Leu Arg Leu Met Ala Ser Gly 
290 295 300 

Pro Arg Ala Gly Leu Ala Glu He Ser Leu Pro Ala Arg Gin Pro Gly 
305 310 315 320 

Ser Ser He Met Pro Gly Lys Val Asn Pro Val Met Ala Glu Leu He 

325 330 335 

Asn Gin He Ala Phe Gin Val He Gly Asn Asp Asn Thr He Cys Leu 

340 345 350 

Ala Ser Glu Ala Gly Gin Leu Glu Leu Asn Val Met Glu Pro Val Leu 
355 360 365 

Val Phe Asn Leu Leu Gin Ser He Ser He Met Asn Asn Gly Phe Arg 
370 375 380 

Ser Phe Thr Asp Asn Cys Leu Lys Gly He Glu Ala Asn Glu Lys Arg 
385 390 395 400 

Met Lys Gin Tyr Val Glu Lys Ser Ala Gly Val He Thr Ala Val Asn 

405 410 415 



Pro His Leu Gly Tyr Glu Ala Ala Ala Arg He Ala Arg Glu Ala He 
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420 425 ■ 430 

Met Thr Gly Gin- Ser Val'Arg Asp Leu Cys Leu Gin Hi's Asp' Val Leu 
435,, 440 ' 445 ' 

I ' 

I 

Thr Glu Glu Glu Leu Asp lie lie Leu Asn Pro Tyr Glu Met Thr Lys 
450 455 460 



Pro Gly lie Ala Gly Lys Glu Leu Leu Glu: Lys 
465 470 ' 475 

i i 1 ■ 

<210> 68 ■ ■ 

<211> 768 ' 
<212> DNA , " ■ 

<213> Bacillus subtilis 



i 



<220> , ' . , 

<221> CDS 

<222> (il) . . (765) 1 

<400> 68 , 
atg aaa cga gaa age aac att caa gtg etc age cgt ggt caa aaa gat 48 
Met Lys Arg Glu Ser Asn lie Gin Val Leu Ser Arg Gly Gin Lys Asp 
15 10 15 

cag cct gtg age cag att tat caa gta tea aca atg act tct eta tta 96 
Gin Pro Val Ser Gin lie Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 

' 20 25 30 

gac gga gta tat gac gga gat ttt gaa ctg tea gag att ccg aaa tat 144 
Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu lie Pro Lys Tyr 
35 40 45 

gga gac ttc ggt ate gga ace ttt aac aag ctt gac gga gag ctg att 192 
Gly Asp Phe Gly He Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu He 
50 55 60 

ggg ttt gac ggc gaa ttt tac cgt ctt cgc tea gac gga acc gcg aca 240 
Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 * 80 

ccg gtc caa aat gga gac cgt tea ccg ttc tgt tea ttt acg ttc ttt 288 
Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 

85 90 95 

aca ccg gac atg acg cac aaa att gat gcg aaa atg aca cgc gaa gac 336 
Thr Pro Asp Met Thr His Lys He Asp Ala Lys Met Thr Arg Glu Asp 

100 105 110 

ttt gaa aaa gag ate aac age atg ctg cca age aga aac tta ttt tat 384 
Phe Glu Lys Glu He Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 

gca att cgc att gac gga ttg ttt aaa aag gtg cag aca aga aca gta 432 
Ala He Arg He Asp Gly Leu Phe Lys Lys Val Gin Thr Arg Thr Val 
130 135 140 
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gaa ctt caa gaa aaa cct tac gtg cca atg gtt gaa gcg gtc aaa aca 
Glu Leu Gin Glu' Lys Pro Tyr Val Pro Met Val Glu Ala. Val Lys ' Thr 
145 150 155 ' 160 



480 

i 



cag ccg att ttc aac ttc gac aac gtg aga gga acg att gta ggt ttc 528 
Gin Pro lie Phe Asn Phe Asp Asn Val Arg Gly Thr lie Val Gly Phe 

165 ' 170 , 175 

ttg aca cca get tat gca aac gga ate gec gtt' tct ggc, tat cac ctg 576 
Leu Thr Pro Ala Tyr Ala Asnj Gly lie Ala Val Ser Gly Tyr His Leu 

' 180 185 ' 190 ' ' 1 

cac ttc att gac gaa gga cgc| aat tea ggc gga cac gtt ttt gac tat 624 
His Phe lie Asp Glu Gly Arg : Asn Ser Gly Gly His Val Phe Asp Tyr 
195 : 200 205 1 1 ' 

gtg ctt gag gat tgc acg gtt acg att tct caa aaa atg aac atg aat 672 
Val Leu Glu Asp Cys Thr Val Thr lie Ser Gin Lys Met Asn Met Asn 
210 215 ; 220 

etc aqa ctt ccq aac aca aca aat ttc ttt aat gcg aat ctg gat aac 720 
Leu Arg Leu Pro Asn Thr Ala! Asp Phe Phe Asn Ala Asn Leu Asp Asn 
225 230 ! 235 24Q 

cct gat ttt gcg aaa gat ate gaa aca act gaa gga age cct gaa taa 768 
Pro Asp Phe Ala Lys Asp lie Glu Thr Thr Glu Gly Ser Pro Glu 

245 250 255 



<210> 69 
<211> 255 
<212> PRT 

<213> Bacillus subtilis 
<400> 69 

Met Lys Arg Glu Ser Asn lie Gin Val Leu Ser Arg Gly Gin Lys Asp 
15 10 15 

Gin Pro Val Ser Gin lie Tyr Gin Val Ser Thr Met Thr Ser Leu Leu 

20 25 30 

Asp Gly Val Tyr Asp Gly Asp Phe Glu Leu Ser Glu lie Pro Lys Tyr 
35 40 45 

Gly Asp Phe Gly lie Gly Thr Phe Asn Lys Leu Asp Gly Glu Leu lie 
50 55 60 

Gly Phe Asp Gly Glu Phe Tyr Arg Leu Arg Ser Asp Gly Thr Ala Thr 
65 70 75 80 

Pro Val Gin Asn Gly Asp Arg Ser Pro Phe Cys Ser Phe Thr Phe Phe 

85 90 95 

Thr Pro Asp Met Thr His Lys lie Asp Ala Lys Met Thr Arg Glu Asp 

100 105 110 

Phe Glu Lys Glu lie Asn Ser Met Leu Pro Ser Arg Asn Leu Phe Tyr 
115 120 125 
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Ala He Arg lie £sp Gl'y Leu Phe'Lys 'Lys Val 'Gin Thr' Airg Thr Val 

130 ' ' 135 ' i , 140 ' 

' • i 

Glu Leu Gin Glu ,Lys Pro Tyr Val Pro Met' Val Glu Ala Val Lys Thr 
145 150 155 160 

Gin Pro He Phe Asn Phe Asp Asn Val Arg' Gly Thr lie Val Qly Phe 

165 * ' 170: " ■ ■' 1 ' , 175 

i * i ' ' i 

• i ■ 

Leu Thr Pro Ala Tyr Ala Asn Gly i lie Ala Vai Ser Gly Tyr His Leii 

1 1 ' 180 , ' 185 , 190. 

i 

His Phe He Asp Glu Gly Arg Asn Ser Gly Gly His Val Phe Asp Tyr 
195 200 • 205 

i 1 

i t 

Val Leu Glu Asp Cys Thr Val Thr lie Ser' Gin Lys Met Asn Met Asn 
210 215 ' , 220 

Leu Arg 1 Leu Pro • Asn Thr Ala Asp Phe Phe Asn Ala 1 Asn Leu Asp Asn 
225 230 235 240 

t 

Pro Asp Phe Ala Lys Asp He Glu Thr Thr Glu Gly Ser Pro Glu 

245 250 255 



<210> 70 
<211> 1254 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (1251) 

<400> 70 

atg aca ttc tec ctt ttt ggt gac aaa ttt acc cgc cac tec ggc att 48 
Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 
1*5 10 15 

acg ctg ttg atg gaa gat ctg aac gac ggt tta cgc acg cct ggc gcg 96 
Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 

20 25 30 

att atg etc ggc ggc ggt aat ccg gcg cag ate ccg gaa atg cag gac 14 4 
He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

tac ttc cag acg eta ctg acc gac atg ctg gaa agt ggc aaa gcg act 192 
Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

gat gca ctg tgt aac tac gac ggt cca cag ggg aaa acg gag eta etc 240 
Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

aca ctg ctt gec gga atg ctg cgc gag aag ttg ggt tgg gat ate gaa 288 
Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 
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( , 85 , , 90 95 

i \ , * t * • • 

cca cag aat att' gca eta aca aac ggc age ckg age gcg ttt ttc tac 336 

Pro Gin Asn'Tle Ala Leu Thr Asn Gly Ser, Gin Ser Ala Phe Phe T^r 

100 i 105 ' 110 

tta ttt aac ctg ttt gee gga cgc cgt gec gat ggt egg gtc aaa aaa 38,4 
Leu Phe Asn Leu phe Ala Gly Arg Arg Ala ' Asp Gly Arg Val Lys Lys 
115 , 120 : " 12*5 ' , 

gt^g ctg f ttc ccg ctt gca ccg gai'tac att ggc tat get gac gee gga 4 32 
Val Leu Phe Pro Leu Ala Pro Glu Tyr 'lie Gly Tyr Ala Asp- Ala Gly 
130 i 135 ' ( 140 

i 

ctg gaa gaa gat ctg ttt gtc tct gcg cgt , ccg aat att gaa ctg ctg 480 
Leu Glu Glu Asp Leu Phe Val Ser A,la Arg Pro Asn lie Glu Leu Leu 
145 150 155 160 

ccg gaa ggc cag ttt aaa tac cac gtc gat ttt gag cat ctg cat 1 att 528 
Pro Giu 1 Gly Gin' Phe Lys Tyr His Val Asp Phe Glu His Leu His lie 

165 170 175 

ggc gaa gaa acc ggg atg att tgc gtc tec egg ccg acg aat cca aca 576 
Gly Glu Glu Thr Gly Met lie Cys Val Ser Arg Pro Thr Asn Pro Thr 

180 185 190 

i 

ggc aat gtg att act gac gaa gag ttg ctg aag ctt gac gcg ctg ggc 624 

Gly Asn Val lie Thr Asp Glu Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 200 205 

aat caa cac ggc att ccg ctg gtg att gat aac get tat ggc gtc ccg 672 
Asn Gin His Gly lie Pro Leu Val lie Asp Asn Ala Tyr Gly Val Pro 
210 215 220 

ttc ccg ggt ate ate ttc agt gaa gcg cgc ccg eta tgg aat ccg aat 720 
Phe Pro Gly lie lie Phe Ser Glu Ala Arg Pro Leu Trp Asn Pro Asn 
225 230 235 240 

ate gtg ctg tgc atg agt ctt tec aag ctg ggt eta cct ggc tec cgc 768 
lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 

245 250 255 

t9 c 99 c a ^t ate ate gee aat gaa aaa ate ate acc gee ate acc aat 816 
Cys Gly lie lie lie Ala Asn Glu Lys lie lie Thr Ala lie Thr Asn 

260 265 270 

atg aac ggc att ate age ctg gca cct ggc ggt att ggt ccg gcg atg 864 
Met Asn Gly lie lie Ser Leu Ala Pro Gly Gly lie Gly Pro Ala Met 
275 280 285 

atg tgt gaa atg att aag cgt aac gat ctg ctg cgc ctg tct gaa aca 912 
Met Cys Glu Met lie Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

gtc ate aaa ccg ttt tac tac cag cgt gtt cag gaa act ate gee ate 960 
Val lie Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr lie Ala lie 
305 310 315 320 
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att cgc cgc tat 
lie Arg Arg Tyr 



tta 
Leu 
325 



ccg gaa aat cgc 
Pro Glu Asn Arg 



tgc ctg att cat aaa 
Cys Leu lie His Lys 
330 



ccg gaa 
Pro' Glu' 
335 , 



1008 



gga 
Gly 



gcc 
Ala 



att 
He 



ttc 
Phe 
340 



etc 
Leu 



tgg 
Trp 



eta 
Leu 



tgg 
Trp 



ttt 
Phe 
345 



aag gat ttg ccc 
Lys Asp Leu Pro 



att 
He 
350 



acg acc 
Thr Thr 



1056 



aag 
Lys 



cag 
Gin 



ggg cac 
Gly His 
370 



etc tat 
Leu Tyr 
355 



aac 
Asn 



ttc 
Phe 



cag cgc ctg aaa 
Gin Arg Leu; Lys 

360 



ttc cca 
Phe Pro 



ggg; 

Gly 

375; 



ctg 
Leu 



gca 
Ala 



gat 
Asp 



cgc 
Arg 



aaa 
Lys 



ggc. 
Gly 



ccg 
Pro 



gtg ctg 
Val Leu 
365 



tgg 
Trp 
380 



ccg 
Pro 



atg 
Met 



cat 
His 



gtg ccg 
Val Pro 



1104 



acg 
Thr 



cat 
His 



1152 



caa tgt atg cgc atg aac tac. gta cca gag ccg gag aaa att 
Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys He 
385 390 395 



gag gcg 
Glu Ala 
400 



1200 



ggg gtg aag att ctg gcg gaa gag ata gaa aga gcc tgg get gaa agt 
Gly Val Lys He Leu Ala GluiGlu He Glu Arg Ala Trp Ala Glu Ser 

405 i ! 410 415 



1248 



cac taa 
His 



1254 



<210> 71 
<211> 417 
<212> PRT 

<213> Escherichia coli 
<400> 71 

Met Thr Phe Ser Leu Phe Gly Asp Lys Phe Thr Arg His Ser Gly He 
15 10 15 

Thr Leu Leu Met Glu Asp Leu Asn Asp Gly Leu Arg Thr Pro Gly Ala 

20 25 30 

He Met Leu Gly Gly Gly Asn Pro Ala Gin He Pro Glu Met Gin Asp 
35 40 45 

Tyr Phe Gin Thr Leu Leu Thr Asp Met Leu Glu Ser Gly Lys Ala Thr 
50 55 60 

Asp Ala Leu Cys Asn Tyr Asp Gly Pro Gin Gly Lys Thr Glu Leu Leu 
65 70 75 80 

Thr Leu Leu Ala Gly Met Leu Arg Glu Lys Leu Gly Trp Asp He Glu 

85 90 95 

Pro Gin Asn He Ala Leu Thr Asn Gly Ser Gin Ser Ala Phe Phe Tyr 

100 105 110 

Leu Phe Asn Leu Phe Ala Gly Arg Arg Ala Asp Gly Arg Val Lys Lys 
115 120 125 
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Val Leu Phe Pro Leu Ala Pro Glu Tyr lie Gly Tyr Ala Asp Ala Gly 
i '130 ' 135 . ' 140 '. ' 

Leu Glu Glu Asp Leu Phe Val Ser Ala Arg Pro Asn lie Glu Leu Leu 
145 150 155 ' 160 

Pro Glu Gly Gin Phe Lys Tyr His Val Asp Phe Glu His Leu His lie 

165 * 170 ' 175 



Gly Glu Glu Thr Gly Met lie 

180 



Cys Val Ser Arg Pro Thr Asn Pro Thr ( 
185 ! 190 ' 



Gly Asn Val lie Thr Asp Glu' Glu Leu Leu Lys Leu Asp Ala Leu Gly 
195 : 200 205 



I.* 



I r 

Asn Gin His Gly lie Pro Leu: Val lie Asp Asn Ala Tyr Gly Val Pro 
210 215 220 



Phe Pro Gly lie lie Phe Ser : Glu Ala Arg Pro Leu Trp Asn Pro Asn 

225 230 1 235 240 

lie Val Leu Cys Met Ser Leu Ser Lys Leu Gly Leu Pro Gly Ser Arg 

245 ! 250 255 



Cys Gly lie lie lie Ala Asn Glu Lys lie lie Thr Ala lie Thr Asn 

260 265 270 

Met Asn Gly lie lie Ser Leu Ala Pro Gly Gly lie Gly Pro Ala ;Met 
275 280 285 

Met Cys Glu Met lie Lys Arg Asn Asp Leu Leu Arg Leu Ser Glu Thr 
290 295 300 

Val lie Lys Pro Phe Tyr Tyr Gin Arg Val Gin Glu Thr He Ala He 
305 310 315 320 

He Arg Arg Tyr Leu Pro Glu Asn Arg Cys Leu lie His Lys Pro Glu ' 

325 330 335 

Gly Ala He Phe Leu Trp Leu Trp Phe Lys Asp Leu Pro He Thr Thr 

340 345 350 

Lys Gin Leu Tyr Gin Arg Leu Lys Ala Arg Gly Val Leu Met Val Pro 
355 360 365 

Gly His Asn Phe Phe Pro Gly Leu Asp Lys Pro Trp Pro His Thr His 
370 375 380 

Gin Cys Met Arg Met Asn Tyr Val Pro Glu Pro Glu Lys lie Glu Ala 
385 390 395 400 

Gly Val Lys He Leu Ala Glu Glu He Glu Arg Ala Trp Ala Glu Ser 

405 410 . 415 



His 
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<210> 72 

, <211> 8803 ' I i ' ' .., ■ _ • 

<212> DNA 1 ' ' " - ' i , 

<213> Artificial Sequence , ' . ' 

i 

<220> 

<223> Description of Artificial Sequence: Recombinant , 
pAN294 plasmid ' ' ■ ' 



<4oo> 72 

tgcgccgcta cagggcgcgt ccattcgfcca ttcaggctgc gcaactgttg ggaagggcga 60 



tcggtgcggg cctpttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 

ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctgg atgaaaagcc 240 

• 1 

gatgaccgct tttcaggtct gtcagcagct ttttcctgct gtatatgaaa aggaattgtt 300 

'i ' , 

tttaacgatg tcagaaacgg caggtcacct tgatgtgttg gaggctgaag aagccatcac 360 
gtcatattgg gaaggaaata ccgtatactt taaaacaatg aagaggtgaa atgggtgaaa 420 

i 1 

catatagcgg gaaaaaggat ttggataacc ggcgcttcag gagggcttgg agaaagaatc , 480 
gcatacttat gcgcggctga aggagcccat gtcctgctgt cggctagacg cgaggatcgt 540 
ttgatagaaa tcaaaaggaa aataaccgag gaatggagcg gacagtgtga gatttttcct 600 
ctggatgtcg gccgcctaga ggatatcgcc cgggtccgcg atcagatcgg ctcgattgat 660 
gtactgatta acaatgcagg cttcggtata tttgaaacgg ttttagactc tacattggat 720 
gacatgaaag cgatgtttga tgtgaatgtc ttcggcctga tcgcctgtac aaaagcggtg 780 
cttccgcaaa tgcttgagca aaaaaaggga catatcatca atatcgcctc tcaagcgggg 840 
aaaatcgcca caccgaagtc tagcctgtat tccgcgacca aacatgccgt gttaggttac 900 
tcaaacgctt tgcggatgga gctttcggga accggcattt atgtgacaac agtcaacccg 960 
ggcccgattc agacggactt tttttccatt gctgataaag gcggggacta cgccaaaaat 1020 
gtcggccgct ggatgcttga tcctgatgac gtggcagctc aaattacagc tgcaattttt 1080 
acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggcactaa gctgtatcag 1140 
ctgtttccag ctcttgtaga aaagctggca ggacgcgcgc tcatgaaaaa ataatgatag 1200 
aactgcctgt ggtggagtgg cttgtttctc acggggcagt ttttgatagt ggaagggaga 1260 
gattgttgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 1320 
cagaaactga ttgatgcaga gcctgtattc tgggcaaatt caggcaagaa agaggggcca 1380 
ttaccccgtg cagatgagtg ggcaaccgag atagcggaag cggaaaaaag aatgcagcgg 1440 
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tttgcacctt acattgccga ggtgtttcct gagacgaaag gcgctaaagg aatcatcgag 1500 

i 1 i ■ • • 

tctccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata tcagcagcca 1560 
tttcccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg atcgattaaa 1620 

i 

i 

gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 16B0 
atgcttcagg aaaccgatga ttaccgcatc ttacaggaag agcggtttac cgggtttttc 1740 
t9ccgctat f t cgattgctgt cggttcgacd ggaaatctag gtttaagcat cggcatcatc. 1800 
ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa 1860 

i 

aaggatctcc tccgccaaaa gggagtcact gtta'tggagt acgaaacaga ttacagtgaa 1920 
gcggtgaacg aagggagacg gcaggcggaa caagatccat tctgttattt tattgatgat 1980 
gaacattctc gtcagctgtt cttaggatat gctgttgctg caagccgatt aaaaacacag 2040 
cttgactgta tgaatataaa qccaagtctt qaqacgccct tgtttgtgta tctgccgtgc ?100 
ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttttata cggagatgat 2160 
gttcatgtgt ttttcgcaga accaactcat tcaccttgta tgctgttagg gctttattca ' 2220 
ggacttcacg agaagatctc cgtccaggat atcggcctgg ataatcagac ggctgctgac 2280 
ggacttgccg tagggaggcc gtcaggattt gtcggcaagc tgattgaacc gcttctgagc 2340 

i ( 

ggctgttata cggtagagga caatacgctt tatactttgc ttcatatgct ggctgtatct 2400 
gaagataaat atttagagcc ctctyctctt gctggcatgt tcgggccggt tcagcttttt 2460 
tcgacagaag agggaaggcg ctatgctcag aaatataaga tggaacatgc cgtacatgtc 2520 
gtctggggaa cgggaggaag catggttcca aaagatgaaa tggctgcgta taaccgaatc 2580 
ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgt 2640 
tacaatcttt gtaagggttt taacctacag agagtcaggt gtaaacagtg aaaaataaag 2700 
aacttaacct acatacttta tatacacagc acaatcggga gtcttggtct ggttt tgggg 2760 
ggcatttgtc gattgctgta tctgaagaag aggcaaaagc tgtggaagga ttgaatgatt 2820 
atctatctgt tgaagaagtg gagacgatct atattccgct tgttcgcttg cttcatttac 2880 
atgtcaagtc tgcggctgaa cgcaataagc atgtcaatgt ttttttgaag cacccacatt 2940 
cagccaaaat tccgtttatt atcggcattg ccggcagtgt cgcagtcgga aaaagcacga 3000 
cggcgcggat cttgcagaag ctgctttcgc gtttgcctga ccgtccaaaa gtgagcctta 3060 
tcacgacaga tggtttttta tttcctactg ccgagctgaa aaagaaaaat atgatgtcaa 3120 
gaaaaggatt tcctgaaagc tatgatgtaa aggcgctgct cgaatttttg aatgacttaa 3180 
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aatcctttaa aaaaacacaa aagaccacat tttttaatgt ggtctttatt cttcaactaa 3780 



i i 



aaaattgatt tagacaattg gaagagaaaa gagatattta atcattattt gaaccaacaa 4020 
acgactttta gtataaccac agaaattgat attagtgttt tataccgaaa cataaaacaa 4080 
gaaggatata aattttaccc tgcatttatt ttcttagtga caagggtgat aaactcaaat 4140 
acagctttta gaactggtta caatagcgac ggagagttag gttattggga taagttagag 4200 
ccactttata caatttttga tggtgtatct aaaacattct ctggtatttg gactcctgta 4260 
aagaatgact tcaaaga'gtt ttatgattta tacctttctg atgtagagaa atataatggt 4320 
tcggggaaat tgtttcccaa aacacctata cctgaaaatg ctttttctct ttctattatt 4380 
ccatggactt catttactgg gtttaactta aatatcaata ataatagtaa ttaccttcta 4440 
cccattatta cagcaggaaa attcattaat aaaggtaatt caatatattt accgctatct 4500 
ttacaggtac atcattctgt ttgtgatggt tatcatgcag gattgtttat gaactctatt 4560 
caggaattgt cagataggcc taatgactgg cttttataat atgagataat gccgactgta 4620 
ctttttacag tcggttttct aatgtcacta acctgccccg ttagttgaag aaggttttta 4680 
tattacagct gtcgactcgt gatcttcgga caggctgttc agctttttct caatgcgatc 4740 
cagctgcgct tttcggtttt tcgcatactt gaagcctgta acagccgcaa agacgacagc 4800 
ggcaaatata ataaatacaa acagctgaaa catcacatca cctatattca tgttcttcac 4860 
ctcatgtttg cgggagagat tcattctctt ccgtttttta tttaaagcgg cttttccaga 4920 



gaccgcgagg 

• , s ' 


i ' 


i 

atta^tgttc 


jjUU 


i » 
tccgatttct 


3360 


ta ttt agagc 




cataaattta 


o /ton 
J4 o U 


agtgtcaacc 


3540 


attttgcgta 


3600 


gtgctigcagc 


3660 


gctacgctca 


3720 


cttcaactaa 


3780 


aaatatatat 


3840 


aagcagacaa 


3900 


gaactttaat 


3960 


gaaccaacaa 


4020 


cataaaacaa 


4080 


aaactcaaat 


4140 


taagttagag 


4200 


gactcctgta 


4 260 


atataatggt 


4320 


ttctattatt 


A *5 O f\ 

4380 


ttaccttcta 


4 4 40 


accgct.a t ct 


a Q nn 
3 DUU 


g a a c t c r. a t. c 


*1 D OU 


gccgactgua 




aaggttttta 


4680 


caatgcgatc 


4740 


agacgacagc 


4800 


tgttcttcac 


4860 


cttttccaga 


4920 
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cgggaacggt gt,tttgtggt ctccattttc a.ttfgccgat aggcgaacgc taaaaatggc 4980 

aggccgagca gggtaatgcc gctcaggaca gaaaaaatat . aaatcggccg gccagcgcca 5040 

' ' 1 I ' 

aacaggtcta tacafatccc cccgacccaa gggccgatga cgtttccgag ctgtggaaaa 5100 

j 

ccgattgccc cgaaataagt gccttttaat cctggttttg caatctggtc tacatacaaa 51,60 

tccatcatag agaataaaag cacttcgccg attgtaaatg tgatgacaat catcacaatt 5220 

i ■ . 

ga.tggaacap cgtgtgatac ggtgaaaatcf gccatgctga tgctaaccat cacattaccg 5280 

1 i 

agcatcagag aacaaagcgg cgaaaaccgt tttgcaaaat ggacaatggg aaattgcgtc 5340 

gccaacacaa cgattgcgtt taatgtcagc atcagcccat acagcttcgt tccattgccg 5400 

atcaaggggt tctgcgccat atactgaggcj aatgtggaac tgaattgtga gtagccgaag 5460 

gtgcatacfcg taatgccgac caaagcaatg gtaaaaagat aatccttttg cgtgaccata 5520 

aacgcttccc gcacgctcat atttcgggac tgggctggtg ctgataagga tqqatgtttt 5580 

ttaaattgga gggcaagcac aattccgtat agtccgtaaa tgactgcagg caccaaaaag 5640 

ggcgtagtcg attgcgatga gccgaaatat aggccaagca caggtccgaa gacaacgccg '5700 

atattaatag ccgcatagcg taaattaaaa actagcagtc tcgttttttc ttctgtcata 5760 

tcagacaaca aggcctttga agcgggctca aacagtgatt tgcaaagacc gtttaatgcg 5820 

tttactacaa aaaacaccca gagattagat gctgccgcaa agcctgcaaa taccagcatc 5880 

catccgaaaa tcgatacaag catcatgttt tttctgccga atttatctga gatatatccg 5940 

ccgtaaaagc ttgcgaggat gccgactgat gagctcgcgg cgatgaccag ccctgcatag 6000 

gaagctgatg cgccttggac ggctgtcaaa taaatcgcta aaaaaggaat gctcatcgat 6060 

gttgccattc tgccgaaaat ggttccgatt ataattgtac gcgttggatg catagcttga 6120 

gtattctata gtgtcaccta aatagcttgg cgtaatcatg gtcatagctg tttcctgtgt 6180 

gaaattgtta tccgctcaca attccacaca acatacgagc cggaagcata aagtgtaaag 6240 

cctggggtgc ctaatgagtg agctaactca cattaattgc gttgcgctca ctgcccgctt 6300 

tccagtcggg aaacctgtcg tgccagctgc attaatgaat cggccaacgc gcggggagag 6360 

gcggtttgcg tattgggcgc tcttccgctt cctcgctcac tgactcgctg cgctcggtcg 6420 

ttcggctgcg gcgagcggta tcagctcact caaaggcggt aatacggtta tccacagaat 6480 

caggggataa cgcaggaaag aacatgtgag caaaaggcca gcaaaaggcc aggaaccgta 6540 

aaaaggccgc gttgctggcg tttttcgata ggctccgccc ccctgacgag catcacaaaa 6600 

atcgacgctc aagtcagagg tggcgaaacc cgacaggact ataaagatac caggcgtttc 6660 
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aaggatctca agaagatcct ttgatctttt ctacggggtc tgacgctcag tggaacgaaa 7200 



gttaccaatg cttaatcagt gaggcaccta tctcagcgat ctgtctattt cgttcatcca 7380 

tagttgcctg actccccgtc gtgtagataa ctacgatacg ggagggctta ccatctggcc 7440 

ccagtgctgc aatgataccg cgagacccac gctcaccggc tccagattta ucagcaataa 7500 

accagccagc cggaagggcc gagcgcagaa gtggtcctgc aactttatcc gcctccatcc 7560 

agtctattaa' ttgttgccgg gaagctagag taagtagttc gccagttaat agtttgcgca 7620 

acgttgttgg cattgctaca ggcatcgtgg tgtcacgctc gtcgtttggt atggcttcat 7680 

tcagctccgg ttcccaacga tcaaggcgag ttacatgatc ccccatgttg tgcaaaaaag 7740 

cggttagctc cttcggtcct ccgatcgttg tcagaagtaa gttggccgca gtgttatcac 7800 

tcatggttat ggcagcactg cataattctc ttactgtcat gccatccgta agatgctttt 7860 

ctgtgactgg tgagtactca accaagtcat tctgagaata ccgcgcccgg cgaccgagtt 7920 

gctcttgccc ggcgtcaata cgggataata gtgtatgaca tagcagaact ttaaaagtgc 7980 

tcatcattgg aaaacgttct tcggggcgaa aactctcaag gatcttaccg ctgttgagat 8040 

ccagttcgat gtaacccact cgtgcaccca actgatcttc agcatctttt actttcacca 8100 

gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc aaaaaaggga ataagggcga 8160 

cacggaaatg ttgaatactc atactcttcc tttttcaata ttattgaagc atttatcagg 8220 

gttattgtct catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg 8280 

ttccgcgcac atttccccga aaagtgccac ctgtatgcgg tgtgaaatac cgcacagatg 8340 

* 

cgtaaggaga aaataccgca tcaggcgaaa ttgtaaacgt taatattttg ttaaaattcg 8400 



ggatacctgt 


6720 

i ■ 


i 

aggtatctca 


6780 


gttcagcccg 


6840 


cacgacttat 


6900 


ggcggtgcta 


6960 


1 i 

i 

tttggtatct 


7020 


tccggcaaac 


7080 


i 

cgcagaaaaa 


7140 


tggaacgaaa 


7200 


tagatccttt 


7260 


tggtctgaca 


7320 


cgttcatcca 


7380 


ccatctggcc 


7440 


ucagcaataa 


7500 


gcctccatcc 


7560 


agtttgcgca 


7620 


atggcttcat 


7680 


tgcaaaaaag 


7740 


gtgttatcac 


7800 


agatgctttt 


7860 


cgaccgagtt 


7920 


ttaaaagtgc 


7980 


ctgttgagat 


8040 


actttcacca 


8100 


ataagggcga 


8160 


atttatcagg 


8220 


caaatagggg 


8280 


cgcacagatg 


8340 


ttaaaattcg 


8400 
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cgttaaatat ttgttaaatc agctcatttt 

i 

cttataaatc aaaagaatag accgagatag 

i * 

i i 

gtccactatt aaagaacgtg gactccaacg 

atggcccact acgtgaacca tcacccaaat 

i 

ctctaaatcg gaaccctaaa gggagccccc 

acgtggcgag aaaggaaggg aagaaagcga 

i i 

• tagcggtcac gctgcgcgta accaccacac 

i 

1 t 
<210> 73 i 
<211> 8320 
<212> DNA 

<213> Artificial Sequence , 

<220> j 
<223> Description of Artificial 
pAN296 plasmid ! 



<400> 73 
tgcgccgcta 


cagggcgcgt 


ccattcgcca 


tcggtgcggg 


cctcttcgct 


attacgccag 


ttaagttggg 


taacgccagg 


gttttcccag 


ttgtaatacg 


actcactata 


gggcgaattg 


gatgaccgct 


tttcaggtct 


gt cagcagct 


tttaacgatg 


tcagaaacgg 


caggtcacct 


gt catattgg 


gaaggaaata 


ccgtatactt 


catatagcgg 


gaaaaaggat 


ttggataacc 


gcatacttat 


gcgcggctga 


aggagcccat 


ttgatagaaa 


tcaaaaggaa 


aataaccgag 


ctggatgtcg 


gccgcctaga 


ggatatcgcc 


gtactgatta 


acaatgcagg 


cttcggtata 


gacatgaaag 


cgatgtttga 


tgtgaatgtc 


cttccgcaaa 


tgcttgagca 


aaaaaaggga 


aaaatcgcca 


caccgaagtc 


t agcctgtat 


tcaaacgctt 


tgcggatgga 


gctttcggga 


ggcccgattc 


agacggactt 


tttttccatt 
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ttaaccaata 


ggccgaaatc 


qgcaaaatcc 

> . • • ' ■ . -< 


8460 

i 


ggttgagtgt 


i 

tgttccagtt 


i 

tggaacaaga 

i 


8520 


tcaaagggcg aaaaaccgtc 


tatcagggcg 


8580 


caagtttttt 


gcggtcgagg 


tgccgtaaag 


8640 


gatttagagc 

i j 


ttgacgggga 


aagccggcga 

* 


8700 


aaggagcggg 


cgctagggcg 


ctggcaagtg 


8760 


ccgccgcgct 


taa 




8803 



equence : Recombinant 



ttcaggctgc 


gcaactgttg 


ggaagggcga 


60 


ctggcgaaag 


ggggatgtgc 


tgcaaggcga 


120 


t cacgacgtt 


gtaaaacgac 


ggccagtgaa 


180 


ggcccgacgt 


cgcatgctgg 


atgaaaagcc 


240 


ttttcctgct 


gtatatgaaa 


aggaattgtt 


300 


tgatgtgttg 


gaggctgaag 


aagccatcac 


360 


taaaacaatg 


aagaggtgaa 


atgggtgaaa 


420 


ggcgcttcag 


gagggct tgg 


agaaagaatc 


480 


gtcctgctgt 


cggctagacg 


cgaggatcgt 


540 


gaatggagcg 


gacagtgtga 


gatttttcct 


600 


cgggtccgcg 


atcagatcgg 


ctcgattgat 


660 


tttgaaacgg 


ttttagactc 


tacattggat 


720 


ttcggcctga 


tcgcctgtac 


aaaagcggtg 


780 


catatcatca 


atatcgcctc 


tcaagcgggg 


840 


tccgcgacca 


aacatgccgt 


gttaggttac 


900 


accggcattt 


atgtgacaac 


agtcaacccg 


960 


gctgataaag 


gcggggacta 


cgccaaaaat 


1020 
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gtcggccgct ggaigcttga tcctgatgac gtggcagctc 'aaattacagc tgcaattttt 1080 

acgaaaaagc gggagatcaa tcttccgcgt ttaatgaatg ccggca'ctaa gctgtatcag 1140 

i ■ 
ctgtttccag ctcttgtaga aaagctggca ggacgcgcgc tcatgaaaaa ataatgatag 1200 

i 

aactgcctgt ggtggagtgg cttgtttctc f acggggcagt ttttgatagt ggaagggaga 1260 
gattgttgaa tgtcagttca ttcagaagtc cttcatgctc tgcttaaaga tccgtttatt 1320 

cagaaactga ttgatgcaga gcctgtattc t'gggcaaatt caggcaagaa agaggggcca 1380 

i 

ttaccccgtg cagatgagtg ggcaaccgag atagcggaag cgga'aaaaag aatgcagcgg 1440 

\ 

tttgcecctt acattgccga ggtgtttcct gagacgaaag gcgctaaagg aatcatcgag 1500 

i * 

tctccgcttt ttgaggtgca gcatatgaag ggaaagctgg aagcggcata tcagcagcca 1560 

i 

tttcccggaa gatggctttt aaagtgcgac catgagcttc cgatttcagg atcgattaaa 1620 
gcgaggggcg ggatttatga agtgttaaag tatgctgaaa atctcgcgct tcaagaagga 1680 
atgcttcagg aaaccgatga ttaccgcatc ttacaggaag agcggtttac cgggtttttc 1740 
tcccgctatt cgattgctgt cggttcgaca. ggaaatctag gtttaagcat cggcatcatc 1800 
ggcgcggcac tcgggtttcg cgtgacagtg catatgtccg ccgatgctaa gcagtggaaa 1860 
aaggatctcc tccgccaaaa gggagtcact gttatggagt acgaaacaga ttacagtgaa 1920 
gcggtgaacg aagggagacg gcaggcggaa caagatccat tctgttattt tattgatgat 1980 
gaacattctc gtcagctgtt cttaggatat gctgttgctg caagccgatt aaaaacacag 2040 
cttgactgta tgaatataaa gccaagtctt gagacgccct tgtttgtgta tctgccgtgc 2100 
ggagtcggcg gaggaccggg cggtgtagca tttgggctga agcttttata cggagatgat 2160 
gttcatgtgt ttttcgcaga accaactcat tcaccttgta tgctgttagg gctttattca 2220 
ggacttcacg agaagatctc cgtccaggat atcggcctgg ataatcagac ggctgctgac 2280 
ggacttgccg tagggaggcc gtcaggattt gtcggcaagc tgattgaacc gcttctgagc 2340 
ggctgttata cggtagagga caatacgctt tatactttgc ttcatatgct ggctgtatct 2400 
gaagataaat atttagagcc ctctgctctt gctggcatgt tcgggccggt tcagcttttt 2460 
tcgacagaag agggaaggcg ctatgctcag aaatataaga tggaacatgc cgtacatgtc 2520 
gtctggggaa cgggaggaag catggttcca aaagatgaaa tggctgcgta taaccgaatc 2580 
ggtgctgatt tgctaaaaaa acgaaatgga aaataagcag acagtgaaaa ggttttccgt 2640 
tacaatcttt gtaagggttt taacctacag agagtcaggt gtaaacagtg aaaaataaag 2700 
aacttaacct acatacttta tatacacagc acaatcggga gtcttctgca gctcgagcaa 2760 
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tagttaccct tattatcaag ataagaaaga. aaaggatttt tcgctacgct caaatccttt 2820 

1 ' 11 i r 

aaaaaaacac aaaagaccac attttttaat gtggtcttta ttcbtcaact aaagcaccca 2880 

ttagttcaac aaacgaaaat tggataaagt gggatatttt taaaatatat atttatgtta 2940 

cagtaatatt gacttttaaa aaaggattga ( ttctaatgaa gaaagcagac aagtaagcct 3000 



cctaaattca ctttagataa aaatttagga ggcatatcaa atgaacttta ataaaattga 3060 

1 1 1 i ■, ■ 1 

i -, ■ 1 

tttagacaat tggaagagaa aagagatatt taatcattat ttgaaccaac aaacgacttt 3120 

i 

tagtataacc acagaaattg atattag-cgt ' tttataccga aacataaaac aagaaggata 3180 

taaattttac cctgcattta ttttcttagt gacaagggtg ataaactcaa atacagcttt 3240 

i * 

tagaactggt tacaatagcg acggagagtt aggttattgg gataagttag agccacttta 3300 
tacaattttt gatggtgtat ctaaaacatt ctctggtatt tggactcctg taaagaatga 3360 

> 

cttcaaagag ttttatgatt tatacctttc tgatgtagag aaatataatg gttcggggaa 3420 

i 

attgtttccc aaaacaccta tacctgaaaa tgctttttct ctttctatta ttccatggac 3480 
ttcatttact gggtttaact taaatatcaa taataatagt aattaccttc tacccattat 3540 

i 

tacagcagga aaattcatta ataaaggtaa ttcaatatat ttaccgctat ctttacaggt 3600 

acatcattct gtttgtgatg gttatcatgc aggattgttt atgaactcta ttcaggaatt 3660 

gtcagatagg cctaatgact ggcttttata atatgagata atgccgactg tactttttac 3720 

agtcggtttt ctaatgtcac taacctgccc cgttagttga agaaggtttt tatattacag 3780 

ctgtcgacta aggtcgagga agtgttggta aggagggtat gaaatgtgca tcatattgaa 3840 

ctgtatgtct ctgatttgga ggcgtctagg cggttttggg gctggttctt aaaagaactt 3900 

ggttataaag agtatcaaaa atggagctca ggcatcagct ggaagaaaga tcgtttttac 3960 

ctagtgattg tgcaggcgaa agagccattt ctagagccgg aataccatag atgccgagtc 4020 

ggtctgaacc atctcgcatt tcatgctgaa tccaagcttc aagtcgatca gatgactgaa 4080 

aaattgacgg caaaaggcta tcgtgtgttg taccgagaca ggcatccttt tgccggagga 4140 

gacgggcatt atgcagtctt ttgtgaggat ccagaccgga ttaaggtaga gctcgttgcc 4200 

ccaagctgtt aatcgtgatc ttcggacagg ctgttcagct ttttctcaat gcgatccagc 4260 

tgcgcttttc ggtttttcgc atacttgaag cctgtaacag ccgcaaagac gacagcggca 4320 

aatataataa atacaaacag ctgaaacatc acatcaccta tattcatgtt cttcacctca 4380 

tgtttgcggg agagattcat tctcttccgt tttttattta aagcggcttt tccagacggg 4440 

aacggtgttt tgtggtctcc attttcattt gccgataggc gaacgctaaa aatggcaggc 4500 
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cgagcagggt aatgccgctc aggacagaaa aaatataaat, cggccggcca gcgcbaaaca 4560' 

ggtctataca tatccccccg acccaagggc cgatgacgtt tccgagctgt ggaaaaccga 4620 

i _ ) ' 

* i 

i * 

ttgccccgaa ataagtgcct tttaatcctg gttttgcaat ctggtctaca tacaaatcca 4680 
tcatagagaa taaaagcact tcgccgattg .taaatgt'gat gacaatcatc acaattgatg 4740 

■ * 
i 

gaacaccgtg tgatacggtg aaaatggcca tgctgatgct aaccatcaca ttaccgagca 4800 
tcagagaaca aagcggcgaa aaccgttttg caaaatgga'c aatgggaaat tgcgtcgcca 4860 

i ■ ■ ' 

i 

acacaacgat tgcgtttaat gtcagcatca gcccatacag cttcgttcca ttgccgatca 4920 

i ' 1 '■' ■ 

aggggttctg cgccatatac tgagggaatg tggaactgaa ttgtgagtag ccgaaggtgc 4980 

atagcgtaat gccgaccaaa gcaatggtaa aaagataatc cttttgcgtg accataaacg 5040 

cttcccgcac gctcatattt cgggactggg ctggtgctga taaggatgga tgttttttaa 5100 * 

attggagggc aagcacaatt ccgtatagtc cgtaaatgac tgcaggcacc aaaaagggcg 5160 

tagtcgattg cgatgagccg aaatataggc caagcacagg tccgaagaca acgccgatiat 5220 

taatagccgc atagcgtaaa ttaaaaacta gcagtctcgt tttttcttct gtcatatcag 5280 

acaacaaggc ctttgaagcg ggctcaaaca gtgat.ttgca aagaccgttt aatgcgttta 5340 

ctacaaaaaa cacccagaga ttagatgctg ccgcaaagcc tgcaaatacc agcatccatc 5400 

cgaaaatcga tacaagcatc atgttttttc tgccgaattt atctgagata tatccgccgt 5460 

aaaagcttgc gaggatgccg actgatgagc tcgcggcgat gaccagccct gcataggaag 5520 

ctgatgcgcc ttggacggct gtcaaataaa tcgctaaaaa aggaatgctc atcgatgttg 5580 

ccattctgcc gaaaatggtt ccgattataa ttgtaacgcg ttggatgcat agcttgagta 5640 

ttctatagtg tcacctaaat agcttggcgt aatcatggtc atagctgttt cctgtgtgaa 5700 

attgttatcc gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct 5760 

ggggtgccta atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc 5820 

agtcgggaaa cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg 5880 

gtttgcgtat tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc 5940 

ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag 6000 

gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa 6060 

aggccgcgtt gctggcgttt ttcgataggc tccgcccccc tgacgagcat cacaaaaatc 6120 

gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc 6180 

ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg 6240 
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cctttctccc 

1 1 


ttcgggaagc 


gtggcgcttt 


cggtgtaggt 


cgttcgctcc 


aagctgggc.t- 


gctgcgcctt 


atccggtaac 


tatcgtcttg 

* 


cactggcagc 


agccactggt 


aacaggatta 


agttcttgaa 


gtggtggcct 


aac.tacggct 


ctctgctgaa 


gccagttacc 


ttcggaaaaa 


ccaccgctgg 


tagcggtggt 

* 


ttttttgttt 

i 


gatctcaaga 


agatcctttg 


i 

atcttttcta 


cacgttaagg 


gattttggtc 


atgagattat 

. i • 


a fcaaaaatg 


aagttttaaa 


tc'aatctaaa 


accaatgctt 


aatcagtgag 


i 
i 

gcacctat^ct 


ttgcctgact 


ccccgtcgtg 


tagataacta 


gtgctgcaat 


gataccgcga 


gacccacgct 


agccagccgg 


aagggccyag 


cgcagaagtg 


ctattaattg 


ttgccgggaa 


gctagagtaa 


ttgttggcat 


tgctacaggc 


atcgtggtgt 


gctccggtt c 


ccaacgatca 


aggcgagtta 


tt agctcctt 


cggtcctccg 


atcgttgtca 


tggttatggc 


agcactgcat 


aattctctta 


tgactggtga 


gtactcaacc 


aagtcattct 


cttgcccggc 


gtcaatacgg 


gataatagtg 


tcattggaaa 


acgttcttcg 


gggcgaaaac 


gttcgatgta 


acccactcgt 


gcacccaact 


tttctgggtg 


agcaaaaaca 


ggaaggcaaa 


ggaaatgttg 


aatactcata 


ctcttccttt 


attgtctcat 


gagcggatac 


atatttgaat 


cgcgcacatt 


tccccgaaaa 


gtgccacctg 


aaggagaaaa 


taccgcatca 


ggcgaaattg 


taaatatttg 


ttaaatcagc 


tcatttttta 
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ctcatagctc acgctigtagg 


tatctcagtti 

i 


63t)0 


i , 

gtgtgcacga 


acqccccgtt 


cagcccg'acc 


6360 


agtccaaccc 


ggtaagacac 


gacttatcgc 


6420 


gcagagcgag 


gtatgtaggc 


ggtgctacag 

i 


6480 


acactagaag 


gacagtattt 


ggtatct»gpg 


6540 


gagt tggtag 


i 

ctcttgatcc 


ggcaaacaaa 


6600 


gcaagcagca 


gattacgcgc 


agaaaaaaag 

t i.' • 


6660 


cggggtctga 


cgctcagtgg 

t 


aacgaaaact 


6720 


caaaaaggat 


i 

cttcacctag 


atccttttaa 


6780 


gtatatatga 


gtaaacttgg 


tctgacagtt 


6840 


cagcgatctg 


tctatttcgt 


tcatccatag 


6900 


cgatacggga 


gggcttacca' 


tctggcccca 


6960 


caccggctcc 


agatttatca 


gcaataaacc 


7020 


gtcctgcaac 


tt tatccgcc 


tccatccagt 


7080 


gtagttcgcc 


agttaatagt 


ttgcgcaacg 


7140 


cacgctcgtc 


gtttggtatg 


gcttcattca 


7200 


catgatcccc 


catgttgtgc 


aaaaaagcgg 


7260 


gaagtaagtt 


ggccgcagtg 


ttatcactca 


7320 


ctgtcatgcc 


atccgtaaga 


tgcttttctg 


7380 


gagaataccg 


cgcccggcga 


ccgagttgct 


7440 


tatgacatag 


cagaacttta 


aaagtgctca 


7500 


tctcaaggat 


cttaccgctg 


ttgagatcca 


7560 


gat cttcagc 


atctt ttact 


ttcaccagcg 


7620 


atgccgcaaa 


aaagggaata 


agggcgacac 


7680 


ttcaatatta 


ttgaagcatt 


tatcagggtt 


7740 


gtatttagaa 


aaataaacaa 


ataggggttc 


7800 


tatgcggtgt 


gaaataccgc 


acagatgcgt 


7860 


taaacgttaa 


tattttgtta 


aaattcgcgt 


7920 


accaataggc 


cgaaatcggc 


aaaatccctt 


7980 
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ataaatcaaa agaatagacc gagatagggt tgagtgttgt itccagtttgg aacaagagtc 8040 

■ ■ ■ • • • , , 

cactattaaa gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg 8100 

1 i 1 . ■ i 'i 

i 

i • ■ 

gcccactacg tgaaccatca cccaaatcaa gttttttgcg gtcgaggtgc cgtaaagctc 8160 

i 

■ ^ i 

taaatcggaa ccctaaaggg agcccccgat ttagagcttg acggggaaagf ccggcgaacg 8220 

tggcgagaaa ggaagggaag aaagcgaaag gagcgggcgc tagggcgct'g gcaagtgtag 8280 

\ * i • ■ ■ ' '■ 

cggtcacgct gcgcgtaacc accacacccg ccgcgcttaa 8320 

I 

<210> 74 ( ' - 

<211> 250 

<212> PRT 1 
<213> Clostridium acetobutylicum 

<400>.7,4 i 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 

1 5 10 15 * 

i t i 

Lys Val lie Leu Val Leu Asp Val Gly Asn Thr Asn lie Val Leu Gly 

20 25 30 

lie Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
35 40 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly lie Gin Val Met Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val lie lie Ser 
65 70 75 80 

Ser Val Val Pro Asn lie Met Tyr Ser Leu Glu His Met lie Arg Lys 

85 90 95 

Tyr Phe Lys lie Asn Pro Leu Val Val Gly Pro Gly lie Lys Thr Gly 

100 105 110 

lie Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie 
115 120 125 

Val Asn Ala Val Ala Ala His Glu lie Tyr Lys Arg Ser Leu lie lie 
130 135 140 

lie Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala Val Arg Glu Asn Gly 
145 150 155 160 

Asp Tyr Leu Gly Gly Ala lie Cys Pro Gly lie Lys Val Ser Ser Glu 

165 170 175 

Ala Leu Phe Glu Lys Ala Ala Lys Leu Pro Arg Val Glu Leu lie Lys 

180 185 190 

Pro Ala Tyr Ala lie Cys Lys Asn Thr lie Ser Ser lie Gin Ser Gly 
195 200 205 
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Ile Val Tyr Arg' Tyr Leu Arg Gin Val Lys Tyr Leu Phe. Glu Lys' Leu 
210 215 220- 



Lys Glu Asn Leu Pro Asp Gly Arg Arg Thr Arg Thr Ser Leu Val Leu 

225 230 235 240 

i 

* • 
Ala Thr Gly Gly Leu Ala Lys . Leu lie Asn ( ■ 

245 250 



<210>- 75 
<211> 258 j 
<212> PRT 

<213> Rhodobacter capsulatus 1 1 



<400> 75 . 

Met Leu Leu Cys lie Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 

1 5 ! * 10 15 

i , 1 

Trp Asp Gly Thr Asp Phe Ala ; Ala Thr Trp Arg lie Ala Thr Asp His 



20 



25 



30 



Arg Arg. Thr Ala Asp Glu Tyr : Phe Val Trp Leu Asn Thr Leu Met Gin 
35 . 40 45 

Leu Lys Gly Leu Gin Gly Arg lie Ser Glu Ala lie lie Ser Ser Thr 
50 55 60 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 80 

Asp Cys Arg Pro Tyr Val Val Gly Lys Pro Gly Cys Glu Leu Pro Val 

85 90 95 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 

100 105 110 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu lie Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp Gly Ala 
130 135 140 

Tyr lie Gly Gly Val lie Ala Pro Gly Val Asn Leu Ser Leu Glu Ala 
145 150 155 ' 160 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 

165 170 175 

Gin Gly Val He Gly Thr Asn Thr Val Ala Cys He Gin Ser Gly Val 

180 185 190 

Tyr Trp Gly Tyr He Gly Leu Val Glu Gly He Val Arg Gin He Arg 
195 200 205 



Met Glu Arg Asp Arg Pro Met Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 
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Ser Leu Phe As$ l^eu Gly Phe Asp Leu Phe Asp Lys ValGlu Asp Asp' 

225 . 230' » - 235 ( 240 

Leu Thr Met His Gly Leu Arg Leu lie Phe Asp Tyr Asn Lys Gly Leu 

245 250 255 

Gly Ala i < ' . . 



<210> 76 1 ' 
<211> 10801 

<212> DNA , ' ■ 
<213> Artificial Sequence ( 

<220> : , , 1 

<223> Description of Artificial Sequence: Recombinant 
pAN240 plasmid . 



<400> 76 




gaattttgcg 


gccgcttcga 


ttagtgacat 


tagaaaaccg 


gccagt catt 


aggcctatct 


aaccatcaca 


aacagaatga 


tattaatgaa 


ttttcctgct 


ttaagttaaa 


cccagtaaat 


gtataggtgt 


tttgggaaac 


ataaatcata 


aaactctttg 


tagatacacc 


at caaaaatt 


cgtcgctatt 


gtaaccagtt 


aaat aaatgc 


agggtaaaat 


tatcaatttc 


tgtggttata 


cttttctctt 


ccaattgtct 


tttttatcta 


aagtgaattt 


ttttttaaaa 


gtcaatatta 


ccaattttcg 


tttgttgaac 


aatgtggtct 


tttgtgtttt 


tatcttgata 


ataagggtaa 


ggccatccgt 


caggatggcc 



tcttcaacta 


acggggcagg 


60 


cattatctca 


tattataaaa 


120 


cataaacaat 


cctgcatgat 


180 


taaatatatt 


gaattacctt 


240 


at tactatta 


ttattgatat 


300 


aagagaaaaa 


gcattttcag 


360 


tttctct aca 

* 


tcagaaaggt 


420 


ccaaatacca 


gagaatgttt 


480 


atcccaataa 


cctaactctc 


540 


tatcaccctt 


gtcactaaga 


600 


gtttcggtat 


aaaacactaa 


660 


caaataatga 


ttaaatatct 


720 


catttgatat 


gcctcctaaa 


780 


ttcttcatta 


gaatcaatcc 


840 


ttaaaaatat 


cccactttat 


900 


gaataaagac 


cacattaaaa 


960 


cgaaaaatcc 


ttttctttct 


1020 


agtttgtaga 


aacgcaaaaa 


1080 


ggcagtttat 


ggcgggcgtc 


1140 
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ctgcccgcca ccctccgggc cgttgcttcq caacgttcaa atccgc^ccc ggcggatttg 1200 

1 . 

< i ' i r 

tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctt'tcg 1260 

j 1 

actgagcctt tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc 1320 

i « i 

acactaccat cggcgctacg gcgtttcact itctgagttcg gcatggggtc aggtgggacc 1380 

accgcgctac tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta 1440 

1 1 f i ■ , . ■ 

i 'i < i 1 , , 

atctgtatca ggctgaaaat cttctctcat ccgccaaaac aggatcctac ggaaatggag 1500 

cggcaaaacc gttttactct caaaatctta aaagaaaacc cccgataaag ggggcttttc 1560 

ttctacaaaa ttgtacgggc tggttcgttc cccagcattt gttcaatttt gttttgatca 1620 

i 

! 

ttcagaacag ccacttt'cgg ctcatggctt gccgcttctt gatcagacat cattttgtag 1680 

* 

gaaataataa tgaccttatc tccttcctgc acaaggcgtg cggctgcacc gtttaagcat 1740 

atgacgccgc ttccccgttt accaggaata atatacgttt caagacgtgc tccattatta 1800 

i 

ttattcacaa tttgtacttt ttcattagga agcattccca cagcatcaat gagatcttca 1860 

tcaattgtaa tgcttcccac atagttcagg tttgcttccg taacagttgc cctgtgaagt 1920 t 

ttgccgctca tcattgttcg atacatatta tattctctcc atttctcgaa tatcaataat 1930 

gatattatct attaaacgcg cttttgaaaa agcaactgca acagcgagaa tcatctttcc 2040 

agcaatttca ttcacaggct cgagttccgg ataggaataa agctctacat agtctatggt 2100 

tccgctagtc gtttcaatga tatcttttgc agcttttatc accgcttcag gatctctttc 2160 

accggcttgg acaagttccg cacttgtttg aagggcccga tacagcttag gcgcttcttt 2220 

tctttcctca gctgttaagt atacattgcg agagcttttg gctaagccgt cttcctctct 2280 

gaccgtatcg acaggaacca attcaatatc catgaagaag tcgctgatta acccatcaac 2340 

aacagctacc tgctgcgcat cttttaaacc gaaataggca cgagtcggct tgactagatt 2400 

gaaaagcttc gtcagtacga tcgcgacccc gtcaaaatgt ccttctcttg agcgcccgca 24 60 

taacacgtct . gtgcgtctt t ctacatgaat cgtgacattc ttttcaccgg gatacatatc 2520 

atgagcatct ggcgtaaaaa gaatatcgac tccggcgttt tctgcaagag ctgcatcccg 2580 

ctcaatatcg cgcggatatg cttcaaaatc ttcattaggg ccgaattgtg caggattcac 2640 

aaaaatactc ataataacgg cgtcgttttc ttgtcttgct ttgtctgcta aggttaaatg 2700 

cccctcatgc agaaacccca tcgtcggaac aaatccgatt gacttgccct ctgaatggta 2760 

ttgttttatg gcttctttca gctgtgaaat atcagtaatc tgtctcatct tattttcccc 2820 

cgtacaagcc gtcaagcact gtctggttca tttgaaagga atgcttttgt tcagggaaag 2880 
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cacgatgtct tacatcctga acatatccgc tgattgctgt ttcgatggtt tcatcaatgc 2940' 

gcgtatattg ctttacaaat ttaggtgttc tctcaacacc gtggccgata atatcatgat 3000 

aaacgagaac ttgtccgtcc gctttcacac cagccccgat tccaatgacc ggtatgctta 3060 

1 i i 1 

gcgtctcggc aattttggct gtgagttctg .ccggcacaca ttcpagcaca agcatcatag 3120 

ctcctgcttc ttcgcatttt atactgtctt ctattaat.tt tttggcgctt tgttcgtctt 3180 

i ■ • ' , 1 

tgccctgtac tttatagccg cccagtacgc cgactgactg cggtgtcaaa cctaagtgac 3240 

i 

tgactactgg aatgcctcca agcgtcaatg cgcgaatgga ttcaaacacg ccttctccgc 3300 

cctcaagctt cagtgcgtca gctccgcttt cctgaacgat agccgctgca tttttcagcg 3360 

* 

i i 

tatcttcctt agacaggtga taagacataa acggcatatc tgtcacaata aaggtattcg 3420 

gcgcacccct tttaacggct tttgtatgat ggatcatgtc cgcaactgtc acaccgacag 3480 

ttgaatcaag gccgaggacg accattccaa gtgaatcacc gactaaaatc atgtcaactc 3540 

i r 

ccgcttgttc agcaagttta gctgccggat aatcataagc ggtcagcatg acaatcggtt 3600 

cttcagactc cttcattttt agaaaatcca gttttgtttt catgttttct cctccttcta 3660 

gagcgtcctg ctgttgttaa gattattata ccacaccttg tagataaagt caacaacttt 3720 

ttgcaaaatt tttcaggaat tttagcagag gttgttctgg atgtagaaca aaacatcttt 3780 

ccgctcttgt gctgttagga tatctttctt ggaagctagg taggcctcga gttatggcag 3840 

ttggttaaaa ggaaacaaaa agaccgtttt cacacaaaac ggtctttttc gatttctttt 3900 

tacagtcaca gccacttttg caaaaaccgg acagcttcat gccttataac tgctgtttcg 3960 

gtcgacgatg atctgccgtt ttcttctgca agccaaaaaa ccttccgtta caacgagaag 4020 

gattcttcac tttctaaagt tcggcgagtt tcatccctct gtcccagtcc ttttttggat 4080 

caaggcagac tgctgcaatg tctatctatt ttaataatag gtgcagttcg caggcgatac 4140 

tgcccaatgg aagtatacca aaatcaacgg gcttgtacca acacattagc ccaattcgat 4200 

atcggcagaa tagatttttt taatgccttc gttcgtttct aaaagcagaa cgccttcatc 4260 

atctatacct aacgccttac cgtaaaaggt tccgtttaac gttctggctc tcatattagt 4320 

gccaataccg agcgcatagc tttcccataa aagcttaatc ggcgtaaatc cgtgcgtcat 4380 

ataatcccgg taccgtttct caaagcatag taaaatatgc tggatgacgc cggcccgatc 4440 

aattttttcc ccagcagctt ggctgaggct tgtcgcgatg tccttcaatt catctggaaa 4500 

atcattaggc tgctggttaa cgttaatgcc gatcccaatg atcactgaac gtacgcggtc 4560 

ttcttcagcc tgcatttccg ttaggatacc gactgttttt tttccgttaa tcaaaatatc 4620 
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taagcgtact gaatgtcctt cagtttcttt tcatccagtt tttcgttttg ccgaagctgc 5400 



ttccaggctt tcaggaaaag gggcgcgtct ttcaaaacta tgccaaggtt aattaaaaga 5580 

gcagcccaaa gctcctcacg ggatgttaaa gagaagaatg gaaactcact cgttgaaatc 5640 

aggttttctc gtttatgata aaaaccagga agctcttcat acaatctcgt ttgaatgagt 5700 

gtttgaagcg cctggcgaga agctcttccc tgcagcaatt tctcaaactc tatagttttt 5760 

cgttcgactg aaacatggga gaggagtgat ttttctttcg caatggcttc ttctgtttcc 5820 

ggtgaaagcg taaagccaag ctggctcata aagcgtacgg ctctcagcat acgaagcgca 5880 

tcctcttgaa atctatcctc aggctttcca acggttcgaa tcactttctg atcaatatct 5940 

ttcttgccgc caaaataatc aagcaccttc ccgtccgctg tcatggccat cgcattgatc 6000 

gttaaatctc tgcgttttag atcctcttct aatgatgaga taaattgcac ttctgacggt 6060 

cttctgaaat caacataatc agattcagtc cggaatgtcg tgacttcata ggtttcatcc 6120 

tcccagagca caataatggt cccgtgctct ttgcctacat caacagtccg ctgaaacagc 6180 

cgttctactt gatcaggtgc cgcatctgtc gcgatatcga catctccgat cgttcgtttc 6240 

atatagctgt cacgaactgc gcccccgaca aaataagcct gatggcccgc ttcgattaag 6300 

atgcggagca cgggaagtgc tttgataaaa actttttcca tgtgatcact ccggttctgc 6360 



1 1 * 

gcacaacagci 


46&0 


gccgcaa'aat 


4740 


ttcggcccct 


4800 


tattcgcgag 

r 


4860 


gctggcccat 


4920 

i 


gttttttgat 


4980 


gaagctcttc 

1 r • 


5040 


tttggccgga 


5100 


atgttgaccg 


5160 


tgtcacaaca 


5220 


gtttcgaagc 


5280 


caggttttga 


5340 


ccgaagctgc 


5400 


cattgcgtca 


5460 


tcccgggagc 


5520 


aattaaaaga 


5580 


cgttgaaatc 


5640 


ttgaatgagt 


5700 


tatagttttt 


5760 


ttctgtttcc 


5820 


acgaagcgca 


5880 


atcaatatct 


5940 


cgcattgatc 


6000 


ttctgacggt 


6060 


ggtttcatcc 


6120 


ctgaaacagc 


6180 


cgttcgtttc 


6240 


ttcgattaag 


6300 


ccggttctgc 


6360 
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taaatcggca taaatctgtt catactggct gacaattttt ttagaagaaa attcattttc 6420 

L * " I • 1 ' 1 
I ' 1 ' ' I 

aagcatctct attgccgcct ttgtaaaacg attgcttagc tgttcatctt ctaaaatgct 6480 

catcgcgcgg gctgttgcgg ccgtaacatc accgacatcc accaaaaatc cgctcacatt 6540 

gttttttata acctcaggga taccgccaat gtttgttcca atacaaggca ctccgcaagc 6600 

' . i> . . • i 

catcgcttca agcaggacaa ggccaaagct ttctttttca gatagcagca gcttcaaatc 6660 

gctaata^a'a taaagatctt caacacggtc ttg ( atttcca agcattaaga pttggtcttc 6720 

caagccatat tt.tctgataa gctcgcaggc , tgt cgatttd tccggaccgt ctccgactaa 6780 

. ■ * 

aagcagcttc gctttcgttt tgccagcgat attgcggaac acacggatga catcctgcac 6840 

gcgtgcagcc actggtaaca ggattagcag agcga,ggtat gtaggcggtg ctacagagtt 6900 

cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta tctgcgctct 6960 

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac 7020 

cgctggtagc ggtrggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc 7080 

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg 7140 

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta 7200 

aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca 7260 

atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc 7320 

ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc 7380 

tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc 7440 

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat 7500 

taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt 7560 

tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc 7620 

cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag 7680 

ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt 7740 

tatggoagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac 7800 

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg 7860 

cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat 7920 

' tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc 7980 

gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc 8040 

tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa 8100 
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atgttgaata ctcatactct tcctttttca. atattattga agcatttat'c agggttattg 8160 

i 

• • i 

tctcatgagc gigatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg 8220 

cacatttccc cgaaaag'tgc cacctgacgt ctaagaaacc attattatca tgacattaac 8280 

ctataaaaat aggcgtatca cgaggccctt ( tcgtctcgca tgcggatcag tgagggtttg 8340 

• 1 ' ' ' ' . . 1 

caactgcggg tcaaggatct ggatttcgat cacggcacga tcatcgtgcg ggagggcaag 8400 

ggctccaagg atcgggcctt gatgttaccc gagagcttgg cacccagcct gcgcgagcag 84 60 

gggaattgat ccggtggatg accttttgaa tgacctttaa tagattatat tactaattaa 8520 

ttggggaccc tagaggtccc cttttttatt ^taaaaattt tttcacaaaa cggtttacaa 8580 

gcataacggg ttttgctgcc cgcaaacggg ctgttctggt gttgctagtt tgttatcaga 8640 

atcqcagatc cggcttcagg tttgccggct gaaagcgcta tttcttccag aattgccatg 8700 

attttttccc cacgggaggc gtcactggct cccgtgttgt cggcagcttt gattcgataa 8760 

i 

gcagcatcgc ctgtttcagg ctgtctatgt gtgactgttg agctgtaaca agttgtctca 8820 
ggtgttcaat ttcatgttct agttgctttg ttttactggt ttcacctgtt ctattaggtg 8880, 
ttacatgctg ttcatctgtt acattgtcga tctgttcatg gtgaacagct ttaaatgcac 8940 
caaaaactcg taaaagctct gatgtatcta tcttttttac accgttttca tctgtgcata 9000 
tggacagttt tccctttgat atctaacggt gaacagttgt tctacttttg tttgttagtc 9060 
ttgatgcttc actgatagat acaagagcca taagaacctc agatccttcc gtatttagcc 9120 
agtatgttct ctagtgtggt tcgttgtttt tgcgtgagcc atgagaacga accattgaga 9180 
tcatgcttac tttgcatgtc actcaaaaat tttgcctcaa aactggtgag ctgaattttt 9240 
gcagttaaag catcgtgtag tgtttttctt agtccgttac gtaggtagga atctgatgta 9300 
atggttgttg gtattttgtc accattcatt tttatctggt tgttctcaag ttcggttacg 9360 
agatccattt gtctatctag ttcaacttgg aaaatcaacg tatcagtcgg gcggcctcgc 9420 
ttatcaacca ccaatttcat attgctgtaa gtgtttaaat ctttacttat tggtttcaaa 9480 
acccattggt taagcctttt aaactcatgg tagttatttt caagcattaa catgaactta 9540 
aattcatcaa ggctaatctc tatatttgcc ttgtgagttt tcttttgtgt tagttctttt 9600 
aataaccact cataaatcct catagagtat ttgttttcaa aagacttaac atgttccaga 9660 
ttatatttta tgaatttttt taactggaaa agataaggca atatctcttc actaaaaact 9720 
aattctaatt tttcgcttga gaacttggca tagtttgtcc actggaaaat ctcaaagcct 9780 
ttaaccaaag gattcctgat ttccacagtt ctcgtcatca gctctctggt tgctttagct 9840 
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j aatacaccat aagcattttc cctactgatg ttcatcatct gagcgtattg gttataaigtg, 9900 

aacgataccg tccgttcttt ccttgtaggg, ttttcaa'tcg' tgcjggttgag tagtg t ccaca 9960 

» i 

cagcataaaa ttagcttggt ttcatgctcc gttaagtcat agcgactaat cgctagttca 10020 

tttgctttga aaacaactaa ttcagacata catctcaatt ggtctaggtg attttaatca 10080 

ctataccaat tgagatgggc tagtcaatga taattactag tccttttcct ttgagtt ( gtg 10140 

ggtatctgta aattctgcta gacctttgct ggaaaacttg taaattctgc tagaccctct 10200 

n i ■ * 

gtaaattccg ctagaccttt gtgtgttttt tttgtttata ttcaagtggt tataatttat 10260 

agaataaaga aagaataaaa aaagataaaa agaatagatc ccagccctgt gtataactca 10320 

ctactttagt cagttccgca gtattacaaa aggatgtcgc aaacgct'gtt tgctcctcta 10380 

caaaacagac cttaaaaccc taaaggctta agtagcaccc tcgcaagctc gggcaaatcg 10440 

i 

ctgaatattc cttttgtctc cgaccatcag gcacctgagt cgctgtcttt ttcgtgacat 10500 
tcagttcgct gcgctcacgg ctctggcagt gaatgggggt aaatggcact acaggcgcct 10560 

] 

tttatggatt catgcaagga aactacccat aatacaagaa aagcccgtca cgggcttctc 10620 
agggcgtttt atggcgggtc tgctatgtgg tgctatctga cttttcgctg ttcagcagtt 10680 
cctgccctct gattttccag tctgaccact tcggattatc ccgtgacagg tcattcagac 10740 
tggctaatgc acccagtaag gcagcggtat catcaacagg cttacccgtc ttactgtcaa 10800 
c 10801 

<210> 77 
<211> 8654 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN236 plasmid 

<400> 77 

ctcgaggcct acctagcttc caagaaagat atcctaacag cacaagagcg gaaagatgtt 60 

ttgttctaca tccagaacaa cctctgctaa aattcctgaa aaattttgca aaaagttgtt 120 

gactttatct acaaggtgtg gtataataat cttaacaaca gcaggacgct ctagaggagg 180 

agacatcatg aaaattggaa ttatcggcgg aggctccgtt ggtcttttat gcgcctatta 240 

tttgtcactt tatcacgacg tgactgttgt gacgaggcgg caagaacagg ctgcggccat 300 

tcagtctgaa ggaatccggc tttataaagg cggggaggaa ttcagggctg attgcagtgc 360 
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ggacacgagt atcaattcgg actttgacct gct.tgtcgtg acagtgaagc agcatcagct 420 

p 1 1 • * i 

tcaatctgtt ttttcgtcgc ttgaacgaat cgggaagacg aatatattat: ttttgcaaaa 480 

cggcatgggg catat'ccacg acctaaaaga ctggcacgtt ggccattcca tttatgttgg 540 

i 

aatcgttgag cacggagctg taagaaaatc ggatacagct gttgatcata caggcctagg 600 
tgcgataaaa tggagcgcgt tcgacgatgc tgaaccagac cggctgaaca tcttgtttca 660 
gqataacca/t tcggattttc cgatttatta 1 t^agacggat tggtaccgtc tgctgacggg 720 
caagctgatt gtaaatgcgt gtattaatcc tttaactgcg ttattgcaag tgaaaaatgg 780 
agaactgctg acaacgccag cttatctggc ttttatgaag ctggtatttc aggaggcatg 840 
ccgcatttta aaacttgaaa atgaagaaaa ggcttgggag cgggttcagg ccgtttgtgg 900 
gcaaacgaaa gagaatcgtt' catcaatgct ggttgacgtc attg[gaggcc ggcagacgga 960 

agctgacgcc attatcggat acttattgaa ggaagcaagt cttcaaggtc ttgatgccgt 1020 

i i * i 

ccacctagag tttttatatg gcagcatcaa agcattggag cgaaatacaa acaaagtctt 1080 
ttgagctttt tcggtaacat gctatactca tttcggatca ctaactattt attggaggat '1140 
cctgttttgg cggatgagag aagattttca gcctgataca gattaaatca gaacgcagaa 1200 
gcggtctgat aaaacagaat ttgcctggcg gcagtagcgc ggtggtccca cctgacccca 1260 
tgccgaactc agaagtgaaa cgccgtagcg ccgatggtag tgtggggtct ccccatgcga 1320 
gagtagggaa ctgccaggca tcaaataaaa cgaaaggctc agtcgaaaga ctgggccttt 1380 
cgttttatct gttgtttgtc ggtgaacgct ctcctgagta ggacaaatcc gccgggagcg 1440 
gatttgaacg ttgcgaagca acggcccgga gggtggcggg caggacgccc gccataaact 1500 
gccaggcatc aaattaagca gaaggccatc ctgacggatg gcctttttgc gtttctacaa 1560 
actcttggta cccagaaaaa gcggcaaaag cggctgttaa aaaagcgaaa tcgaagaagc 1620 
tgtctgccgc taagacggaa tatcaaaagc gttctgctgt tgtgtcatct ttaaaagtca 1680 
cagccgatga atcccagcaa gatgtcctaa aatacttgaa cacccagaaa gataaaggaa 1740 
atgcagacca aattcattct tattatgtgg tgaacgggat tgctgttcat gcctcaaaag 1800 
aggttatgga aaaagtggtg cagtttcccg aagtggaaaa ggtgcttcct aatgagaaac 1860 
ggcagctttt taagtcatcc tccccattta atatgaaaaa agcacagaaa gctattaaag 1920 
caactgacgg tgtggaatgg aatgtagacc aaatcgatgc cccaaaagct tgggcacttg 1980 
gatatgatgg aactggcacg gttgttgcgt ccattgatac cggggtggaa tggaatcatc 2040 
cggcattaaa agagaaatat cgcggatata atccggaaaa tcctaatgag cctgaaaatg 2100 
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i ) , 1 

aaatgaactg gtatgatgcc gtagcaggcg aggcaagccc ttatgatgat ttggctcatg 2160 

it i i ' 

gaacccacgt gacaggcacg atggtgggct ctgaacctga tggaa'caaat caaatcggtg 2220 

t . 1 

i 1 

tagcacctgg cgcaaaatgg attgctgtta aagcgttctc tgaagatggc ggcactgatg 2280 

i 

ctgacatttt ggaagctggt gaatgggttt tagcaccaaa ggacgcggaa ggaaatcccc 2340 

acccggaaat ggctcctgat gttgtcaata actcatggg'g agggggctct ggacttgatg 2400 

'aatggtacag agacatggtc aatgcctggc gttcggccga tattttccct gagttttcag 2460 

i cggggaatac ggatctcttt attcccggcg ggcctggttc tatcgcaaat 1 ccggcaaact 2520 

, i 

i 

atccagaatc gtttgcaact ggagcgactg agaattccaa ttccccatgg agagsla'aaga 2580 

aaatcgctaa tgttgattac tttgaacttc tgcatattct tgaatttaaa aaggctgaaa 2640 

gagtaaaaga ttgtgctgaa atattagagt ataaacaaaa tcgtgaaaca ggcgaaagaa 2700 

1 I 4 

agttgtatcg agtgtggttt tgtaaatcca ggctttgtcc aatgtgcaac tggaggagag 2760 

caatgaaaca tggcattcag tcacaaaagg ttgttgctga agttattaaa caaaagccaa 2820 

L 

I 

cagttcgttg gttgtttctc acattaacag ttaaaaatgt tLatgatggc gaagaattaa 2880 

ataagagttt gtcagatatg gctcaaggat ttcgccgaat gatgcaatat aaaaaaatta 2940 

ataaaaatct tgttggtttt atgcgtgcaa cggaagtgac aataaataat aaagataatt 3000 

cttataatca gcacatgcat gtattggtat gtgtggaacc aacttatttt aagaatacag 3060 

aaaactacgt gaatcaaaaa caatggattc aattttggaa aaaggcaatg aaattagact 3120 

atgatccaaa tgtaaaagtt caaatgattc gaccgaaaaa taaatataaa tcggatatac 3180 

aatcggcaat tgacgaaact gcaaaatatc ctgtaaagga tacggatttt atgaccgatg 3240 

atgaagaaaa gaatttgaaa cgtttgtctg atttggagga aggtttacac cgtaaaaggt 3300 

taatctccta tggtggtttg ttaaaagaaa tacataaaaa attaaacctt gatgacacag 3360 

aagaaggcga tttgattcat acagatgatg acgaaaaagc cgatgaagat ggattttcta 3420 

ttattgcaat gtggaattgg gaacggaaaa attattttat taaagagtag ttcaacaaac 3480 

gggccatatt gttgtataag tgatgaaata ctgaatttaa aacttagttt atatgtggta 3540 

aaatgtttta atcaagttta ggaggaatta attatgaagt gtaatgaatg taacagggtt 3600 

caattaaaag agggaagcgt atcattaacc ctataaacta cgtctgccct cattattgga 3660 

gggtgaaatg tgaatacatc ctattcacaa tcgaatttac gacacaacca aattttaatt 3720 

tggctttgca ttttatcttt ttttagcgta ttaaatgaaa tggttttgaa cgtctcatta 3780 

cctgatattg caaatgattt taataaacca cctgcgagta caaactgggt gaacacagcc 3840 
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tttatgttaa 


ccttttccat 


atcaiaaaggt 


tactcctatt 


gttggccatt 


ctttcttttc 


gctgcattte 


cagcactcgt 


ggtaaagcat 


ttggtcttat 


attggtggaa 


tgatagecca 


acaattatca 


ctqttccgtt 


cat t ttgata 


tcaaaggaa t 


acaacat cat 


a t agcatt tc 


aaacatatca 


ggaaagtaac 


111 a l_ U f~l L Lti 

^k*^ Nrt Bv ^4 mj^i 




atggttcctt 


atatgatgaa 


attattt tec 

i 


ctggaacaat 


gatagaagag 


gt cctt.tata 


t taactgett 


cctttcttt t 


qt tttaqqtq 


qqct t teat t 


aaacaqcagg 


aaqctgqtqc 


qqaacaggta 


ttgcaattgt 

7 7 


ttacctatgg 

7 7 


aagttgatca 


ggaatcattg 


tcattagttg 


ttctaaatcq 


ttaagggatc 


cagttcgaag 


cggccgcaat 


aggttaatgt 


catgataata 


tgegeggaac 


ccctatttgt 


gacaataacc 


ctgataaatg 


atttccgtgt 


cgcccttatt 


cagaaacget 


ggtgaaagta 


tcgaactgga 


tctcaacagc 


caatgatgag 


cacttttaaa 



I 1 
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ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac 5640 

I ' ' , . 

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 5700 

i 1 i 

' . • ■. 

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg 57 60 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac 5820 

cggagctgaa tgaagccatai ccaaacgacg 'agcgtgacac cacgatgcct gcagcaatgg 5880 

caacaacgtt gcgcaaacta ttaactggcg[ aactacttac tctagcttcc cggcaacaat 5940 

' i 1 > . . • 

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg 6000 

' i » 

4 ■ * * 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg 6060 

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 6120 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 6180 

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt 6240 

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt 6300 

i 

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt ,6360 

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag 6420 

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca 6480 

gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca 6540 

agaactctgt agcaccgcct acatacctcg ctctgctaat cctyttacca gtggctgctg 6600 

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg 6660 

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct 6720 

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga 6780 

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc 6840 

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg 6900 

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg 6960 

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgttc tttcctgcgt 7020 

tatcccctga ttctgtggat aaccgtatta ccgcctttga gtgagctgat accgctcgcc 7080 

gcagccgaac gaccgagcgc agcgagtcag tgagcgagga agcggaagag cgcctgatgc 7140 

ggtattttct ccttacgcat ctgtgcggta tttcacaccg catatggtgc actctcagta 7200 

caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 7260 

ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 7320 
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gctcccggca tccgcttaca gacaagctgt gacpgtctcc gggagctgca tgtgtcagag 7380 

i 1 i • 1 * 1 ' ' 1 

gttttcaccg tcatcaccga aacgcgcgag gcagctgc^g taaagctcaf cagcgtggtc 7440 

gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 7500 

i 

aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 75,60 

ggtcacttga tgcctccgtg taagggggaa tttctgttca tgggggtaat gataccgatg 7620 

aa,acgagaga ggatgctcac gatacgggtt 1 actgatgatg aacatgcccg gttactggaa 7680 

cgttgtgagg gtaaacaact ggcggtatgg atgcggcggg accagagaaa aatcactcag 7740 

ggtcaatgcc agcgcttcgt taatacagat gtaggtgttc cacagggtag ccagcagcat 7800 

cctgcgatgc agatccggaa cataatggtg cagggcgctg acttccgcgt ttccagactt 7860 

tacgaaacac ggaaaccgaa gaccattcat gttgttgctc aggtcgcaga cgttttgcag 7920 

cagcagtcgc ttcacgttcg ctcgcgtatc ggtgattcat tctgctaacc agtaagg.caa 7980 

ccccgccagc ctagccgggt cctcaacgac aggagcacga tcatgcgcac ccgtggccag 804 0 

gacccaacgc tgcccgagat gcgccgcgtg cggctgctgg agatggcgga cgcgatggat '8100 

atgttctgcc aagggttggt ttgcgcattc acagttctcc gcaagaattg attggctcca 3160 

atr.cttggag tggtgaatcc gttagcgagg tgccgccggc ttccattcag gtcgaggtgg 8220 

cccggctcca tgcaccgcga cgcaacgcgg ggaggcagac aaggtatagg gcggcgccta .8280 

caat'ccatgc caacccgttc catgtgctcg ccgaggcggc ataaatcgcc gtgacgatca 8340 

gcggtccagt gatcgaagtt aggctggtaa gagccgcgag cgatccttga agctgtccct 8400 

gatggtcgtc atctacctgc ctggacagca tggcctgcaa cgcgggcatc ccgatgccgc 84 60 

cggaagcgag aagaatcata atggggaagg ccatccagcc tcgcgtcggc ggccgcttcg 8520 

tcgaccgaaa cagcagttat aaggcatgaa gctgtccggt ttttgcaaaa gtggctgtga 8580 

ctgtaaaaag aaatcgaaaa agaccgtttt gtgtgaaaac ggtctttttg tttcctttta 8640 
accaactgcc ataa 8654 

<210> 78 
<211> 8093 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN4 23 plasmid 

<400> 78 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 60 
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aaagtggctg tgactgtaaa aagaaatcga aaaagaccgfc tttgtgtgaa aafcgigtpt'tt! 120 ' 
ttgtttcctt ttaaccaact gccataactc, gaggcct'acc* tagcttccaa gaaagatatc 180 
ctaacagcac aagagcggaa agatgttttg ttctacatcc agaacaacct cfcgctaaaat 240 

i ( i 

tcctgaaaaa ttttgcaaaa agttgttgac . tttatct'aca aggtgtggta taataatctt 300 

aacaacagca ggacgctcta gaaaaggagg aatttaaatg tatcgtacga tgatgagcgg 360 

caaacttcac agggcaactg ttacggaagc aaacctgaac tatgtgggaa gcattacaat 420 

tgatgaagat ctcattgatg ctgtgggaat gcttcctaat gaaaaagtac aaattgtgaa 480 

• i 

taataataat ggagcacgtc ttgaaacgta tattattcct ggtaaacggg gaagcggcgt 540 

i 

catatgctta aacggtgcag ccgcacgcct tgtgcaggaa ggagataagg tcattattat 600 

i . 

ttcctacaaa atgatgtctg atcaagaagc ggcaagccat gagccgaaag tggctgttct 660 

gaatgatcaa aacaaaattg aacaaatgct ggggaacgaa ccagcccgta caattttgta 720 

1 

aaggatcctg ttttggcgga tgagagaaga ttttcagcct gatacagatt aaatcagaac 780 

gcagaagcgg tctgataaaa cagaatttgc ctggcggcag tagcgcggtg gtcccacctg 840 

accccatgcc gaactcagaa gtgaaacgcc gtagcgccga tggtagtgtg gggtctcccc 900 

atgcgagagt agggaactgc caggcatcaa ataaaacgaa aggctcagtc gaaagactgg 960 

gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc tgagtaggac aaatccgccg 1020 

ggagcggatt tgaacgttgc gaagcaacgg cccggagggt ggcgggcagg acgcccgcca 1080 

taaactgcca ggcatcaaat taagcagaag gccatcctga cggatggcct ttttgcgttt 1140 

ctacaaactc ttggtaccca gaaaaagcgg caaaagcggc tgttaaaaaa gcgaaatcga 1200 

agaagctgtc tgccgctaag acggaatatc aaaagcgttc tgctgttgtg tcatctttaa 1260 

aagtcacagc cgatgaatcc cagcaagatg tcctaaaata cttgaacacc cagaaagata 1320 

aaggaaatgc agaccaaatt cattcttatt atgtggtgaa cgggattgct gttcatgcct 1380 

caaaagaggt tatggaaaaa gtggtgcagt ttcccgaagt ggaaaaggtg cttcctaatg 1440 

agaaacggca gctttttaag tcatcctccc catttaatat gaaaaaagca cagaaagcta 1500 

ttaaagcaac tgacggtgtg gaatggaatg tagaccaaat cgatgcccca aaagcttggg 1560 

cacttggata tgatggaact ggcacggttg ttgcgtccat tgataccggg gtggaatgga 1620 

atcatccggc attaaaagag aaatatcgcg gatataatcc ggaaaatcct aatgagcctg 1680 

aaaatgaaat gaactggtat gatgccgtag caggcgaggc aagcccttat gatgatttgg 1740 

ctcatggaac ccacgtgaca ggcacgatgg tgggctctga acctgatgga acaaatcaaa 1800 
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'tcggtgtagc acctggcgca aaatggattg ctgttaaagc gttctctgaa gat'ggcggcai 18'60 

ctgatgctga cattttggaa gctggtgaat- gggttttagc acpaaaggac gcggaaggaa 1920 

atccccaccc ggaaatggct cctgatgttg tcaataactc atggggaggg ggctctggac 1980 

i . + 

\ 

ttgatgaatg gtacagagac atggtcaatg cctggcgttc ggccgatatt ttecctgagt 2040 

tttcagcggg gaatacggat ctctttattc ccggcgggcc tggttctatc gcaaatccgg 2100 

1 ■ ■ • 1 ' ■ i 

caaactatcc agaatcgttt gcaactggag cgactgagaa ttccaattcc ccatggagag 2160 

* j * ■ 

aaaagaaaat cgctaatgtt gattactttg aacttctgca tattot tgaa tttaaaaagg 2220 

; I I' 1 

I 

\ 1 

ctgaaagagt aaaagattgt gctgaaatat tagagtataa acaaaatcgt gaaacaggcg 2280 

aaagaaagtt gtatcgagtg tggttttgta aatccaggct ttgtccaatg tgcaactgga 2340 

ggagagcaat gaaacatggc attcagtcac aaaaggttgt tgctgaagtt attaaacaaa 2400 

agccaacagt tcgttggttg tttctcacat taacagttaa aaatgtttat gatggcgaag 2460 

i ! 

| i 

aattaaataa gagtttgtca gatatggctc aaggatttcg ccgaatgatg caatataaaa 2520 

aaattaataa aaatcttgtt ggttttatgc gtgcaacgga agtgacaata aataataaag 2580 

ataattctta taatcagcac atgcatgtat tggtatgtgt ggaaccaact tattttaaga ■ 2640 

atacagaaaa ctacgtgaat caaaaacaat ggattcaatt ttggaaaaag gcaatgaaat 2700 

tagactatga tccaaatgta aaagttcaaa tgattcgacc gaaaaataaa tataaatcgg 2760 

atatacaatc ggcaattgac gaaactgcaa aatatcctgt aaaggatacg gattttatga 2820 

ccgatgatga agaaaagaat ttgaaacgtt tgtctgattt ggaggaaggt ttacaccgta 2880 

aaaggttaat ctcctatggt ggtttgttaa aagaaataca taaaaaatta aaccttgatg 2940 

acacagaaga aggcgatttg attcatacag atgatgacga aaaagccgat gaagatggat 3000 

tttctattat tgcaatgtgg aattgggaac ggaaaaatta ttttattaaa gagtagttca 3060 

acaaacgggc catattgttg tataagtgat gaaatactga atttaaaact tagtttatat 3120 

gtggtaaaat gttttaatca agtttaggag gaattaatta tgaagtgtaa tgaatgtaac 3180 

agggttcaat taaaagaggg aagcgtatca ttaaccctat aaactacgtc tgccctcatt 3240 

attggagggt gaaatgtgaa tacatcctat tcacaatcga atttacgaca caaccaaatt 3300 

ttaatttggc tttgcatttt atcttttttt a.gcgtattaa atgaaatggt tttgaacgtc 3360 

tcattacctg atattgcaaa tgattttaat aaaccacctg cgagtacaaa ctgggtgaac 3420 

acagccttta tgttaacctt ttccattgga acagctgtat atggaaagct atctgatcaa 3480 

ttaggcatca aaaggttact cctatttgga attataataa attgtttcgg gtcggtaatt 3540 
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gggtttgttg 


gccattcttt 

* 


cttttcctta 


ettattatgg 


ggtgcagctg 


catttccagc 


actcgtaatg 


i , . 

gttgtagttg 


aataggggta 


aagcatttgg 


tcttattgga 


tcgatagtag 


ccagcgattg 


gtggaatgat 

i 


agcccattat 

i 


attcattggt 


atgataacaa 


ttatcactgt 


tccgtttctt 


* ** 

atgaaattat 


aaaggtcatt 


ttgatajtcaa 


■ i 

aggaatt'ata 


ctaatgtctg 


ttgtttacaa 


catcatatag 


catttctttt 


cttatcgtta 


tttgtaaaac 


i 

atatcaggaa 


agtaacagat 


ccttttgttg 


atacctttta 


tgattggagt 

t 


i 

tctttgtggg 


ggaattatat 


gtctctaftgg 


ttcctt atat 


gatgaaagat 


gttcaccagc 


agtgtaatta 


ttttccctgg 


aacaatgagt 


gtcattattt 


cttgttgata 


gaagaggtcc 


tt tatacgtg 


ttaaacatcg 


agctt tttaa 


ctgcttcctt 


tcttttagaa 


acaacatcat 


gtatttgttt 


taggtgggct 


ttcgttcacc 


aaaacagt ta 


agcttgaaac 


agcaggaagc 


tggtgctgga 


atgagtttgc 


tcagagggaa 


caggtattgc 


aattgtaggt 


ggtttattat 


aggttgttac 


ctatggaagt 


tgatcagtca 


acttatctgt 


ttttcaggaa 


tcattgtcat 


tagttggctg 


gttaccttga 


agggatttct 


aaatcgttaa 


gggatcaact 


ttgggagaga 


ttataacagt 


tcgaagcggc 


cgcaattctt 


gaagacgaaa 


ttttataggt 


taatgtcatg 


ataataatgg 


tttcttagac 


gaaatgtgcg 


cggaacccct 


atttgtttat 


ttttctaaat 


tcatgagaca 


ataaccctga 


t aaat gcttc 


aataatattg 


ttcaacattt 


ccgtgtcgcc 


cttattccct 


tttttgcggc 


ctcacccaga 


aacgctggtg 


aaagtaaaag 


atgctgaaga 


gttacatcga 


actggatctc 


aacagcggta 


agatccttga 


gttttccaat 


gatgagcact 


tttaaagttc 


tgctatgtgg 


acgccgggca 


agagcaactc 


ggtcgccgca 


tacactattc 


actcaccagt 


cacagaaaag 


catcttacgg 


atggcatgac 
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etcgttttat 


tcaaggggct 


3600 


cgcgctatat 


tccaaaggaa 

• 


3660 


ccatgggaga 

i 


aggagtcggt 


3720 


cctatcttct 


actcattcct 

i 


3780 


taaagaaag'a 


agtaaggata 


3840 


taggcattgt 


attttttatg 


3900 


gcgtgctgtc 


attcctgata 


3960 


atcccggatt 


agggaaaaat 


4020 


ttg'gaacagt 


agcagggttt 


4080 


taagtactgc 


i 

cgaaatcgga 


4140 


tcggctacat 


tggtgggata 


4200 


gagttacatt 


tctttctgtt 


4260 


ggttcatgac 


i 

aattataatc 


4320 


tatcaacaat 


tgtttcaagt 


4 380 


ttaactttac 


cagcttttta 


4440 


ccataccctt 


acttgatcaa 


4500 


atagtaattt 


gttattacrt 


4560 


atgtatataa 


acattctcaa 


4620 


gttcaaaatt 


gatccttttt 


4680 


gggcctcgtg 


atacgcctat 


4740 


gtcaggtggc 


acttttcggg 


4800 


acattcaaat 


atgtatccgc 


4 8 60 


aaaaaggaag 


agtatgagta 


4920 


attttgcctt 


cctgtttttg 


4980 


tcagttgggt 


gcacgagtgg 


5040 


gagttttcgc 


cccgaagaac 


5100 


cgcggtatta 


tcccgtattg 


5160 


tcagaatgac 


ttggttgagt 


5220 


agtaagagaa 


ttatgcagtg 


5280 
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! ctgccataac catgagtgat aacactgcgg ccaactt'act' tctgacaacg atcggagigaci 5340 
cgaaggagct aaccgctttt ttgcacaaca. tggggga'tca tgliaactcgc cttga,tctjtt 5400 

i * 

gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgcag 5460 

1 ; J 

I 

caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc 5520 

1 aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc 5580 

1 ■ ■ • i i 

ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta 5640 
tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg 5700 
ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga 5760 
ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac 5820 

i ' 

ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa 5880 

tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat- 5940 

I 

i 

cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc 6000 
taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg 6060 
gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc 612C 
acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg 6180 
ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg 6240 
ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa 6300 
cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg 6360 
aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga 6420 
gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct 6480 
gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg. aaaaacgcca 6540 
gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgttctttc 6600 
ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga gctgataccg 6660 
ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg gaagagcgcc 6720 
tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata tggtgcactc 6780 
tcagtacaat ctgctctgat gccgcatagt taagccagta tacactccgc tatcgctacg 6840 
tgactgggtc atggctgcgc cccgacaccc gccaacaccc gctgacgcgc cctgacgggc 6900 
ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga gctgcatgtg 6960 
tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag ctgcggtaaa gctcatcagc 7020 
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gtggtcgtga 


agcgattcac 

i 


agatgtctgc 


ctgttcatcc 


gcgtccagct. 


cgttgagttt 


7080 


ctccagaagc 


gttaatgtct 


ggctt ctgat 


aaagcggcjcc 


atgttaaggg 


cggttt.tttc 


7140 


ctgtttggtc 


acttgatgcc 


tccgtgtaag 


ggggaatttc 


tgttcatggg 


ggtaatgata 


7200 


ccgatgaaac 


gagagaggat 

i 


gctcacgata 


cgggttactg 


atgatgaaca 


tgcccggtta 


7260 


ctggaacgtt 


gtgagggtaa 


acaactggcg 


gtattjgatgc 


■ ■ i 
ggcgggacc'a 


gagaaaaatc 


7320 


actcagggtc 


aatgccagcg 


\ • i 

cttcgttaat 


acagatgtag 


gtgttccaca 


gggtagccag 


7380 


cagcatcctg 


V t 

cgatgcagat 


ccggaacata 


i 1 

atggtgcagg 


gcgctgactt 


ccgcgtttcc 


7440 


agactttacg 


aaacacggaa 


accgaagacc 


attcatgttg 


ttgctcaggt 


cgcagacgtt 


7500 


ttgcagcagc 


agtcgcttca 


cgttcgctcg 


cgtatcggtg 


attcattctg 


ctaaccagta 


7560 


aggcaacpcc 


gccagcctag 


ccgggtcctc 


aacgacagga 


gcacgatcat 


gcgcacccgt 


7620 


ggccaggacc 


caacgctgcc 


cgagatgcgc 


cgcgtgcggc 


tgctggagat 


ggcggacgcg 

i 


7680 


atggatatgt 


t ctgccaagg 


gttggtttgc 


gcattcacag 


ttctccgcaa 


gaattgattg 


7740 


actccaattc 


t tqqaqtqqt 


gaat ccgtta 


gcgaqqtqcc 


gccggctt cc 


attcaggtcg 


7800 


aggtggcccg 


gctccatgca 


ccgcgacgca 


acgcggggag 


gcagacaagg 


tatagggcgg 


7860 


cgcctacaat 


ccatgccaac 


ccgttccatg 


tgctcgccga 


ggcggcat aa 


atcgccgtga 


7920 


cgatcagcgg 


tccagtgatc 


gaagttaggc 


tggtaagagc 


cgcgagcgat 


ccttgaagct 


7980 


gtccctgatg 


gtcgt catct 


acctgcctgg 


acagcatggc 


ctgcaacgcg 


ggcatcccga 


8040 


t gccgccgga 


agcgagaaga 


atcataatgg 


ggaaggccat 


ccagcctcgc 


gtc 


8093 



<210> 79 
<211> 8098 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN429 plasmid 

r 

<400> 79 

ggcggccgct tcgtcgaccg aaacagcagt tataaggcat gaagctgtcc ggtttttgca 60 
aaagtggctg tgactgtaaa aagaaatcga aaaagaccgt tttgtgtgaa aacggtcttt 120 
ttgtttcctt ttaaccaact gccataactc gaggcctacc tagcttccaa gaaagatatc 180 
ctaacagcac aagagcggaa agatgttttg ttctacatcc agaacaacct ctgctaaaat 240 
tcctgaaaaa ttttgcaaaa agttgttgac tttatctaca aggtgtggta taataatctt 300 
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aacaacagca ggacjgctcta gattagaaag gaggtttaat taatgtatcg tacgatga.tg 360 
agcggcaaac ttcacagggc aactgttacg gaagcaaacc tgaactatgt gggaagcatt 420 
acaattgatg aagatctcat tgatgctgtg ggaatgcttc ctaatgaaaa agtacaaatt 480 

gtgaataata ataatggagc acgtcttgaa acgtatatta ttcctggtaa acggggaagc 540 

i ■ 

ggcgtcatat gcttaaacgg tgcagccgca cgccttgtgc aggaaggaga taaggtcatt 600 
attatttcct acaaaatgat gtctgatca'a' gaagcggcaa gccatgagcc gaaagtggct 660 

! , 

gttctgaatg atcaaaacaa aattgaacaa atgctggtjga acgaaccagc ccgtacaatt 720 

^ , t ' * 

i * 

ttgtaaagga tcctgttttg gcggatgaga gaagattttc agcctgatac agattaaatc 780 

I i 

agaacgcaga agcggtctga taaaacagaa tttgcctggc ggcagtagcg cggtggtccc 840 

acctgacccc atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc 900 

tccccatgcg agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag 960 

actgggcctt tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc 1020 

cgccgggagc ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc 1080 

cgccataaac tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg 1140 

cgtttctaca aactcttggt acccagaaaa agcggcaaaa gcggctgtta aaaaagcgaa 1200 

atcgaagaag ctgtctgccg ctaagacgga atatcaaaag cgttctgctg ttgtgtcatc 1260 

tttaaaagtc acagccgatg aatcccagca agatgtccta aaatacttga ac£icccagaa 1320 

agataaagga aatgcagacc aaattcattc ttattatgtg gtgaacggga ttgctgttca 1380 

tgcctcaaaa gaggttatgg aaaaagtggt gcagtttccc gaagtggaaa aggtgcttcc 1440 

taatgagaaa cggcagcttt ttaagtcatc ctccccattt aatatgaaaa aagcacagaa 1500 

agctattaaa gcaactgacg gtgtggaatg gaatgtagac caaatcgatg ccccaaaagc 1560 

ttgggcactt ggatatgatg gaactggcac ggttgttgcg tccattgata ccggggtgga 1620 

atggaatcat ccggcattaa aagagaaata tcgcggatat aatccggaaa atcctaatga 1680 

gcctgaaaat gaaatgaact ggtatgatgc cgtagcaggc gaggcaagcc cttatgatga 1740 

tttggctcat ggaacccacg tgacaggcac gatggtgggc tctgaacctg atggaacaaa 1800 

tcaaatcggt gtagcacctg gcgcaaaatg gattgctgtt aaagcgttct ctgaagatgg 1860 

cggcactgat gctgacattt tggaagctgg tgaatgggtt ttagcaccaa aggacgcgga 1920 

aggaaatccc cacccggaaa tggctcctga tgttgtcaat aactcatggg gagggggctc 1980 

tggacttgat gaatggtaca gagacatggt caatgcctgg cgttcggccg atattttccc 2040 
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tgagttttca gcggggaata 

i 

tccggcaaac tatccagaat 
gagagaaaag aaaatcgcta 
aaaggctgaa agagtaaaag 
aggcgaaaga aagttgtatc 

i 

'ctggaggaga gcaatgaaac 
, acaaaagcca acagttcgtt 
cgaagaatta aataagagtt 
taaaaaaatt aataaaaatc 
taaagataat tcttataatc 
taagaataca gaaaactacg 
gaaattagac tatgatccaa 

F 

atcggatata caatcggcaa 
tatgaccgat gatgaagaaa 
ccgtaaaagg ttaatctcct 
tgatgacaca gaagaaggcg 
tggattttct attattgcaa 
gttcaacaaa cgggccatat 
tatatgtggt aaaatgtttt 
gtaacagggt tcaattaaaa 
tcattattgg agggtgaaat 
aaattttaat ttggctttgc 
acgtctcatt acctgatatt 
tgaacacagc ctttatgtta 
atcaattagg catcaaaagg 
taattgggtt tgttggccat 
gggctggtgc agctgcattt 
aggaaaatag gggtaaagca 
tcggtccagc gattggtgga 
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cggat ct ctt 


tat tcccggc 

t 


^ta. jt^m 

gggcctggtt 


ctat cgcaaa 


11 Aft 
2 1UU 

t 


cgtttgcaac 


1 * 

tggagcgact 


gagaattcca 


i 

aticcccatg 


21 60 


i * 

atgttgatta 


ctttgaactt 


ct gcatattc 


ttgaatttaa 


2220 


attgtgctga 


aat att agag 

t 


tataaacaaa 


atcgtgaaac 


2280 


gagtgtggtt 

i 


* 

ttgtaaatcc 


i 

aggctttgtc 


caatgtgcaa 


2340 


t 

atggcattca 


gtcacaaaag 


gttgttgctg 


aagttattaa 


2400 


i 

ggttgtttct 


cacattaaca 

i 


gttaaaaatg 


tttatgatgg 


2460 


tgtcagatat 


ggctcaagga 


tttcgccgaa 


tgatgcaata 


2520 


ttgt tggttt 


tatgcgtgca 


acggaagtga 


caataaataa 


2580 


agcacatgca 

I i 


tgtattggta 


tgtgtggaac 


caacttattt 


2640 


tgaatcaaaa 


acaatggat t 


caattttgga 


aaaaggcaat 


2700 


'. i 

atgtaaaagt 


tcaaatgatt 


cgaccgaaaa 


ataaatataa 


2760 


ttgacgaaac 


tgcaaaatat 


cctgtaaagg 


atacggattt 


2820 


agaatttgaa 


acgtttgtct 


gatttggagg 


aaggt ttaca 


2880 


atggtggttt 


gttaaaagaa 


atacataaaa 


aattaaacct 


2940 


atttgattca 


tacagatgat 


gacgaaaaag 


ccgatgaaga 


3000 


tgtggaattg 


ggaacggaaa 


aat tatttta 


ttaaagagta ' 


3060 


tgt tgtataa 


gtgatgaaat 


actgaattta 


aaacttagtt 


3120 


aatcaagttt 


aggaggaatt 


aattatgaag 


tgtaatgaat 


3180 


gagggaagcg 


tat cattaac 


cctataaact 


acgtctgccc 


3240 


gtgaatacat 


cctattcaca 


atcgaattta 


cgacacaacc 


3300 


attttatctt 


tttttagcgt 


attaaatgaa 


atggttttga 


33 60 


gcaaatgatt 


ttaataaacc 


acctgcgagt 


acaaactggg 


o4 20 


acct t tt cca 


t t ggaacagc 


tgtatatgga 


■■k -*- r B>*t b*b^ ^bh 

aagct at c tg 




ttactcctat 


ttggaattat 


aataaattgt 


ttcgggtcgg 


3540 


tctttctttt 


ccttacttat 


tatggctcgt 


tttattcaag 


3600 


ccagcactcg 


taatggttgt 


agt tgcgcgc 


tatattccaa 


3660 


tttggtctta 


ttggatcgat 


agtagccatg 


ggagaaggag 


3720 


atgatagccc 


attatattca 


ttggtcctat 


cttctactca 


3780 



I 
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ttcctatqat aacaattatc actgttccgt ttcttatgaa attattaaag aaagaagtaa 3840 

i 

ggataaaagg tcattttgat atcaaaggaa ttatactaat gtctgtaggc attgtatttt 3900 

' i 1 

ttatgttgtt tacaacatca tatagcattt cttttcttat cgttagcgtg ctgtcattcc 3960 

tgatatttgt aaaacatatc aggaaagtaa cagatccttt tgttgatccc ggattaggga 4020 

t ' 1 ' 1 

i 

aaaatatacc ttttatgatt ggagttcttt gtgggggaat tatatttgga acagtagcag 4080 

III ' | 

ggtttgtctc tatggttcct tatatgatga aagatgttc.a ccagctaagt actgccgaaa 414'0 
tcg'gaagtgt aattattttc cctggaacaa tgagtgtcat tattttcggc tacattggtg 4200 
ggatacttgt tgatagaaga ggtcctttat acgtgttaaa catcggagtt acatt't'cttt 4260 
ctgttagctt tttaactgct tcctttcttt tagaaacaac atcatgg.ttc atgacaatta 4320 
taatcgtatt tgttttaggt gggctttcgt tcaccaaaac agttatatca acaattgttt 4380 
caagtagctt gaaacagcag gaagctggtg ctggaatgag tttgcttaac tttaccagct 4440 
ttttatcaga gggaacaggt attgcaattg taggtggttt attatccata cccttact;tg 4500 
atcaaaggtt gttacctatg gaagttgatc agtcaactta tctgtatagt aatttgttat 4560 
tacttttttc aggaatcatt gtcattagtt ggctggttac cttgaatgta tataaacatt 4620 
ctcaaaggga tttctaaatc gttaagggat caactttggg agagagttca aaattgatcc 4680 
tttttttata acagttcgaa gcggccgcaa ttct tgaaga cgaaagggcc tcgtgatacg 4740 
cctattttta taggttaatg tcatgataat aatggtttct tagacgtcag gtggcacttt 4800 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 4860 
tccgctcatg agacaataac cctgataaat gcttcaataa tattgaaaaa ggaagagtat 4920 
gagtattcaa catttccgtg tcgcccttat tccctttttt gcggcatttt gccttcctgt 4980 
ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg 5040 
agtgggttac atcgaactgg atctcaacag cggtaagatc cttgagagtt ttcgccccga 5100 
agaacgtttt ccaatgatga gcacttttaa agttctgcta tgtggcgcgg tattatcccg 5160 
tattgacgcc gggcaagagc aactcggtcg ccgcatacac tattctcaga atgacttggt 5220 
tgagtactca ccagtcacag aaaagcatct tacggatggc atgacagtaa gagaattatg 5280 
cagtgctgcc ataaccatga gtgataacac tgcggccaac ttacttctga caacgatcgg 5340 
aggaccgaag gagctaaccg cttttttgca caacatgggg gatcatgtaa ctcgccttga 5400 
tcgttgggaa ccggagctga atgaagccat accaaacgac gagcgtgaca ccacgatgcc 54 60 
tgcagcaatg gcaacaacgt tgcgcaaact attaactggc gaactactta ctctagcttc 5520 
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ccggcaacaa ttaatagact ggatggaggc ggataaagtt gcaggac'cac ttctgcgctc 5580 

! | • . •• • . • • ■ • • 

ggcccttccg gctggctggt ttattgctga' taaatctgga gccggtgagc gtgggtctcg 5640 

> ' t 

' ' ' ! • • 

t 

cggtatcatt gcagcactgg ggccagatgg taagccctcc cgtatcgtag ttatctacac 5700 

i 

gacggggagt caggcaacta tggatgaacg aaatagacag atcgctgaga taggtgcctc 5760 

i 1 ' 1 

actgattaag cattggtaac tgtcagacca agt.ttactca tatatactt,t agattgattt 5820 

h i 
■ t 

aaaacttcat ttttaattta aaaggat'cta ggtgaagatc ctttttgata atctcatgac 5880 
caaaat'ccct taacgtgagt tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa 5940 

aggatcttct tgagatcctt tttttctgcg cgta'atptgc tgcttgcaaa caaaaaaacc 6000 

i 

i • • 

accgctacca gcggtggttt gtttgccgga tcaagagcta ccaactcttt ttccgaaggt 6060 

i 

aactggcttc agcagagcgc agataccaaa tactgtcctt cta^tgtagc cgtagttagg 6120 
ccaccacttc aagaactctg tagcaccgcc tacatacctc gctctgctaa tcctgttacc 6180 
agtggctgct gccagtggcg ataagtcgtg tcttaccggg ttggactcaa gacgatagtt 6240 
accggataag gcgcagcggt cgggctgaac ggggggttcg tgcacacagc ccagcttgga > 6300 
gcgaacgacc tacaccgaac tgagatacct acagcgtgag ctatgagaaa gcgccacgct 6360 
tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc agggtcggaa caggagagcg 6420 
cacgagggag cttccagggg gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca 6480 
cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa 6540 
cgccagcaac gcggcctttt tacggttcct ggccttttgc tggccttttg ctcacatgtt 6600 
ctttcctgcg ttatcccctg attctgtgga taaccgtatt accgcctttg agtgagctga 6660 
taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca gtgagcgagg aagcggaaga 6720 
gcgcctgatg cggtattttc tccttacgca tctgtgcggt atttcacacc gcatatggtg 6780 
cactctcagt acaatctgct ctgatgccgc atagttaagc cagtatacac tccgctatcg 6840 
ctacgtgact gggtcatggc tgcgccccga cacccgccaa cacccgctga cgcgccctga 6900 
cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc 6960 
atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga ggcagctgcg gtaaagctca 7020 
tcagcgtggt cgtgaagcga ttcacagatg tctgcctgtt catccgcgtc cagctcgttg 7080 
agtttctcca gaagcgttaa tgtctggctt ctgataaagc gggccatgtt aagggcggtt 7140 
ttttcctgtt tggtcacttg atgcctccgt gtaaggggga atttctgttc atgggggtaa 7200 
tgataccgat gaaacgagag aggatgctca cgatacgggt tactgatgat gaacatgccc 7260 
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ggttactgga 


acgttgtgag 

' i 


ggtaaacaac tggpggtatg cjatgcggcgg 


gaccagagaa 


7320 


aaatcactca 


gggtcaatgc 


cagcgcttcg 


ttaataca^a 

* 


tgtaggtgtt 


ccacagggta 


7380 


gccagcagca 


tcctgcgatg 


cagatccgga 


acataatggt 


gcagggcgct 


gacttccgcg 


7440 


tttccagact 


ttacgaaaca 


cggaaaccga 


agaccattca 


i 

tgttgttgct 


caggtcgcag 

• 


7500 


acgttttgca 


t 

gcagcagtcg 


cttcacgttc 


gctcgcgtat 


cggtgattca 


ttctgctaac 


7560 


ca.gtaaggca 


accccgccag 


cctagccggc/ 


tcctcaacga 


caggagcacg 


atcatgcgca 


7620 


cccgtggcca 


ggacccaacg 

i 

i 


ctgcccgaga 


tgcgccgcgt 


gcggctgctg 


gagatggcgg 


7680 


acgcgatgga 


tatgttctgc 


caagggttgg 


tttgcgcatt 


cacagttctc 


cgcaagaatt 


7740 


gattggctcc 


< 

aattcttgga 


gtggtgaatc 


i 

cqttagcgag 

i 


gtgccgccgg 


cttccattca 


7800 


ggtcgagQtg 


gcccggctcc 


atgcaccgcg 


acgcaacgcg 


gggaggcaga 


Caaygto Lag 


i a fin 


ggcggcgcct 

i 


acaatccatg 


ccaacccgtt 


ccatg.tgctc 


gccgaggcgg 


cataaatcgc 


7920 


cgtgacgatc 


agcggtccag 


tgatcgaagt 


taggctggta 


agagccgcga 


gcgatccttg 


7980 


aagctgtccc 


tgatggtcgt 


catctacctg 


cctggacagc 


atggcctgca 


acgcgggcat 


'8040 


cccgatgccg 


ccggaagcga 


gaagaatcat 


aatggggaag 


gccatccagc 


ctcgcgtc 


8098 



<210> 80 
<211> 4450 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN4 4 3 plasmid 



<400> 80 
ccatcgaatg 


gccagatgat 


taattcctaa 


tttttgttga 


cactctatca 


ttgatagagt 


60 


tattttacca 


ctccctatca 


gtgatagaga 


aaagtgaaat 


gaatagttcg 


acaaaaatct 


120 


agattagaaa 


ggaggattta 


aatatgagac 


agattactga 


tatttcacag 


ctgaaagaag 


180 


ccataaaaca 


ataccattca 


gagggcaagt 


caatcggatt 


tgttccgacg 


atggggtttc* 


240 


tgcatgaggg 


gcatttaacc 


ttagcagaca 


aagcaagaca 


agaaaacgac 


gccgttatta 


300 


tgagtatttt 


tgtgaatcct 


gcacaattcg 


gccctaatga 


agattttgaa 


gcatatccgc 


360 


gcgatattga 


gcgggatgca 


gctcttgcag 


aaaacgccgg 


agtcgatatt 


ctttttacgc 


420 


cagatgctca 


tgatatgtat 


cccggtgaaa 


agaatgtcac 


gattcatgta 


gaaagacgca 


480 


cagacgtgtt 


atgcgggcgc 


tcaagagaag 


gacattttga 


cggggtcgcg 


atcgtactga 


540 


cgaagctttt 


caatctagtc 


aagccgactc 


gtgcctattt 


cggtttaaaa 


gatgcgcagc 


600 
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atgagcggca aacttcacag ggcaactgtt acggaagcaa acctgaacta tgtgggaagc 1080 



agcgcttgga gccacccgca gttcgaaaaa taataagctt gacctgtgaa gtgaaaaatg 1500 
gcgcacattg tgcgacattt tttttgtctg ccgtttaccg ctactgcgtc acggatctcc 1560 
acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg 1620 
ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca 1680 
cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta 1740 
gtgctttacg gcacctcgac cccaaaaaac ttgattaggg tgatggttca cgtagtgggc 1800 
catcgccctg atagacggtt ttt.cgccctt tgacgttgga gtccacgttc tttaatagtg 1860 
gactcttgtt ccaaactgga acaacactca accctatctc ggtctattct tttgatttat 1920 
aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta 1980 
acgcgaattt taacaaaata ttaacgctta caatttcagg tggcactttt cggggaaatg 2040 
tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga 2100 
gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac 2160 
atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc 2220 
cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca 2280 
tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc 2340 



ttggttjcdtg, 


660 


tacttaacag 


720 


gcggaacttg 


780 


atcattgaaa 


840 


9 

gagcctgtga 


900 


gcgcgtttaa 


960 


tcgtacgatg 

1 i' . 


1020 


tgtgggaagc 


1080 


aaaagtacaa 


1140 


taaacgggga 


1200 


agataaggtc 


1260 


gccgaaagtg 


1320 


agcccgtaca 


1380 


ccatggtctc 


1440 


gtgaaaaatg 


1500 


acggatctcc 


1560 


agcgtgaccg 


1620 


tttctcgcca 


1680 


ttccgattta 


1740 


cgtagtgggc 


1800 


tttaatagtg 


1860 


tttgatttat 


1920 


caaaaattta 


1980 


cggggaaatg 


2040 


ccgctcatga 


2100 


agtattcaac 


2160 


tttgctcacc 


2220 


gtgggttaca 


2280 


gaacgttttc 


2340 
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caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcpcgt attgacgccg 2400 

1 » 

i * i 

ggcaagagca a'ctcggtcgc cgcatacact attctcagaa tgacttggt.t gagtactcac 2460 
cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca 2520 
taaccatgag tgataacact gcggccaact i tacttctgac aacgatcgga ggaccgaagg 2580 

t 

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttcfat cgttgggaac 2640 

i i ■ • 

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg 2700 

i 

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat 2760 
tgatagactg gatggaggcg gataaagttg paggaccact tctgcgctcg gcccttccgg 2820 

r 

ctggctggtt tattgctgat aaatctggag ccggtgagcg tggctctcgc ggtatcattg 2880 

, i .i 
cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc 2940 

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc 3000 

attggtagga attaatgatg tctcgtttag ataaaagtaa agtgattaac agcgcattag 3060 

agctgcttaa tgaggtcgga atcgaaggtt taacaacccg taaactcgcc cagaagctag 3120, 

. gtgtagagca gcctacattg tattggcatg taaaaaataa gcgggctttg ctcgacgcct 3180 

tagccattga gatgttagat aggcaccata ctcacttttg ccctttagaa ggggaaagct 3240 

ggcaagattt tttacgtaat aacgctaaaa gttttagatg tgctttacta agtcatcgcg 3300 

atggagcaaa agtacattta ggtacacggc ctacagaaaa acagtatgaa actctcgaaa 3360 

atcaattagc ctttttatgc caacaaggtt tttcactaga gaatgcatta tatgcactca 3420 

gcgcagtggg gcattttact ttaggttgcg tattggaaga tcaagagcat caagtcgcta 3480 

aagaagaaag ggaaacacct actactgata gtatgccgcc attattacga caagctatcg 3540 

aattatttga tcaccaaggt gcagagccag ccttcttatt cggccttgaa ttgatcatat 3600 

gcggattaga aaaacaactt aaatgtgaaa gtgggtctta aaagcagcat aacctttttc 3660 

cgtgatggta acttcactag tttaaaagga tctaggtgaa gatccttttt gataatctca 3720 

tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga 3780 

tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa 3840 

aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga 3900 

aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt 3960 

taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt 4020 

taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat 4080 
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agttaccgga taaggcgcag cggtcgggct gaacggggg.g ttcgtgcaca ca'gc'ccagct, 4140 

i 

tggagcgaac gacctacacc gaactgagat acctaca'gcg' tgagctatga gaaagcgcca 4200 

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag 4260 

< • . ■ 

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc 4320 

r * - 

I . ■ 

gccacctctg acttgagcgt cgatttttgt gatgctcg.tc aggggggcgg agcct.atgga 4380 



aaaacgccag caacgcggcc ttttt 



acggt tcctggcctt ttgctggcct tttgctcaca 4440 



tgacccgaca i 4 450 

[ 

<210> 81 
<211> 10212 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN251 plasmid ! 



<400> 81 
gcggccgcta 


aaaagagctt 


gaggatttgc 


ggagtgaaaa 


tcagacattg 


eggaatcage 


60 


t agagatgac 


agaagaggat 


tacaaggcac 


tgatcgatat 


catggat egg 


gecagaaaaa 


120 


tggttgtttc 


gaaggaagac 


ggaagaatga 


aaaaagcggc 


t caagaaacg 


taaagaaacg 


180 


cctgaaatga 


accggcccta 


tagtaagaat 


aggccggttg 


t tttgatttc 


tatgeagact 


240 


ctcccggtgt 


catttcgcga 


tccatatcag 


gatgccagat 


gagegggtet 


tcccctttgt 


300 


cccgcgccat 


atcatactta 


acagttttaa 


agttcatttt 


gttccaaaat 


teegctgatt 


360 


tcattctcgg 


atttgtccgg 


atcggcattt 


tgaatgattt 


t gcaaattca 


accaaggctc 


420 


tcccgtatcc 


cctgttctgg 


tagcccggaa 


gaacctcaag 


cttccacagc 


tccaaataat 


480 


cctggcggtt 


gtcaaaatag 


ggattcgatt 


tgccgttaac 


ttgatacaga 


etcattegtg 


540 


ctacaagttt 


atcgccaaaa 


taaatcccgt 


aaaaaggcga 


ggtgctgtca 


ttttcaataa 


600 


tattatcctg 


aagttcttca 


agcattgaaa 


gctcctgaat 


geegtattet 


ttgaatttct 


660 


tgaatt cttc 


cagcgtttta 


tagttgataa 


gcagacgttc 


t acctttgtc 


aaacaaatct 


720 


ccccctttgt 


tgtt tctaca 


tatattgtaa 


acgctttatt 


taaaaaatcc 


aaatatttaa 


780 


actttaattt 


taagcacatg 


ggatctttga 


gaagtaattt 


cttcttact t 


ctgetatgat 


840 


aatacgtaaa 


tgcgtcgacc 


gaaacagcag 


ttataaggca 


tgaagctgtc 


cggtttttgc 


900 


aaaagtggct 


gtgactgtaa 


aaagaaatcg 


aaaaagaccg 


ttttgtgtga 


aaaeggtett 


960 
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cgaqqcct ac 


ctaqcttcca 


agaaaqatat 

1 


1020 

i 


gttctaqat c 


• 

> 

cagaacaacc 


1 , 

tctqctaaaa 

1 


1080 


ct ttatctac 


aagqtqtqqt 


ataataatct 


1140 


acatcat,gaa 


aattggaatt 


atcggcggag 


1200 


i 

t at cactt ta ' 

i ( 


tcacaacata 


act at tat era 

t 


1260 


aatctaaaad 


1 

aatccqactt 


) 

1 

tataaaaoca 


13210 


acacaaatat 


caatt caoac 

V* Ct W V»» \—4 Cl 


tttcacctac 

1* W ^ Wl Xtorf 


1380 


aatctotttt 




• 1 ,, . 

flaacaaafffl 

Cl CI Ct CI \« 


1440 


y i»- y y y y a 


tatrcacaac 

Cl C> Cl >W ^ CI Vsr 


ctaaaaaact 


1500 


tw^ <— l_ y ci y ci 


i_ y y a y »- y c* 


o y a q o ci y y 


J- <J O \J 


a a a a a ^ n 
L- y ci Laaad l_ y 


/-t -a /-t /-» n /-• /-r +- +- /-» 

y ay L.y l. l.^ 




J. D^U 


a Laou^ci c l. 


UUa L L L L ^ y 


Cl L L L O L LQ UU 


1680 


a y L u a i_ lu u 


i- y V-' y V— y l. 




17 4 0 


y Q k_ 


a ^ fnpra rift" 




1800 


U Ua L l— l_ LCOQ 




y c* ex y ci a ci cj y y 


1860 


auuL>u ci ci ci y ci 


y ci u i— wy l ci 


(■paat - rrt**t n n 


1920 


ft" ci A fnrrflt 
v» l u a ^y i» 


C3 \~ \w U u Cl C> V-^ 


l> w Cl ^- y Q d w U 


1 980 


C^A Vp^ L» Cl *^ Ct U V- 


tttatataac 


aacatcaaaa 


2040 


U CI VJ k^ I. L l^ l_ C 


u u \m. a ci c* « — w v« 


tatactcatt" 

^— Cl V- Cl * — ' C V_> Cl \_ \» 


2100 


aaattctaaa 


aaatacaact 


aact gaact t 


2160 


cactatataa 


ataacaaaqa 


aaaaatqtct 


2220 


aacatqtqqc 


gcgaaagact 


ggaagactta 


2280 


gcggcgtact 


taacctctta 


ccataataaa 


2340 


gagaagctga 


aggacccgt c 


aagt gecget 


2400 


acaggaccgc 


tttacaccat 


acataaaatc 


2460 


gaaaaggaac 


tgcaagtgcc 


tgtcatacca 


2520 


ttggatgaga 


ttaattttgt 


tcacacatct 


2580 


cctcagtctt 


attggaagaa 


atcatcagca 


2640 


gccgcgtgga 


tagatgatgt 


ttttgecget 


2700 
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tttgaagaaa 


CayaCCaLoU 

' 1 


/-t ^ 4- /— • ^ /-» 4- 4- 
y od LdLdL.1 L. 


^ 4* 4* ^ A* 

tctgtLacgi- 


' i 

4* +• a ^ <a y-^ 4~ 4— 

LLaCtyaCt L 


4> 4* 4» a a « 4-irt 

CLCiyaaCuy 


4-4- — > /-» 4- 4- 4- 4- -> 4- 

t: t agut eta u 


■ i 

l a a aLcctgg 


yyatccty tt 


tacagattaa 


atCayaaCyc 

r 




gcgcggtggt 


cccacctgac 


^Bft 4* 

cccatgccga 


gtagtgtggg 


gtctccccat 

i 


gcgagagpag 

i 


gctcagt cga 


aagactgggc 


ctttcgtttt 


agtaggacaa 


at ccgccggg 


agcggat t tg 


cgggcaggac 


gcccgccata 


aactgccagg 

i 


gatggccttt 

■ i 


ttgcgtttct 


acaaactctt 


ttaaaaaagc 


gaaatcgaag 


aagctgtctg 


ctgttgtgtc 


atctt.taaaa 


gtcacagccg 


t gaacaccca 


gaaagataaa 


ggaaatgcag 


ggattgctgt 


tcatgcct ca 


aaagaggtt.a 


aaaaggtgct 


tcctaatgag 


.aaacggcagc 


aaaaagcaca 


gaaagctatt 


aaagcaactg 


atgccccaaa 


agcttgggca 


cttggatatg 


a taccggggt 


ggaatggaat 


catccggcat 


aaaatcctaa 


tgagcctgaa 


aatgaaatga 


gcccttat ga 


tgat tt ggct 


cat ggaaccc 


ct ga tggaac 


aaatcaaatc 


gg tgiiagcac 


tctctgaaga 


l gg cggcac u 


gatigctgaca 


ca a a ggacgc 


ggaaggaaax. 


ccccacccyy 


/-V f+* ^ J^Y 

ggggaggggg 




yd ty da tyy t 


ccgatatttt 


ccctgagttt 


tcagcgggga 


gttctatcgc 


aaatccggca 


aactatccag 


ccaattcccc 


atggagagaa 


aagaaaatcg 


ttcttgaatt 


taaaaaggct 


gaaagagtaa 


aaaatcgtga 


aacaggcgaa 


agaaagttgt 



PCT/US00/25993 

-115- 

ctcgacaatg tgaaacgatg tttaagggaa 2760 

ctgatcgcgg atttgttcca agaagagggc 2820 

ttggcggatg agagaagatt ttcagcctga 2880 

tgataaaaca gaatttgcct ggcggcagta 2940 

actcagaagt gaaacgccgt agcgccgatg 3000 

ggaactgcca ggcatcaaat aaaacgaaag 3060 

i 

atctgttgtt tgtcggtgaa cgctctcctg 3120 

aacgttgcga agcaacggcc cggagggtgg 3180 

caitcaaatta agcagaaggc catcctgacg 3240 

ggtacccaga aaaagcggca aaagcggctg 3300 

■f 

ccgctaagac ggaatatcaa aagcgttctg 3360 

atgaatccca gcaagatgtc ctaaaatact 3420 

accaaattca ttcttattat gtggtgaacg . 3480 

tggaaaaagt ggtgcagttt ccogaagtgg 3540 

tttttaagtc atcctcccca tttaatatga 3600 

acggtgtgga atggaatgta gaccaaatcg 3660 

atggaactgg cacggttgtt gcgtccattg 3720 

taaaagagaa atatcgcgga tataatccgg 3780 

actggtatga tgccgtagca ggcgaggcaa 3840 

acgtgacagg cacgatggtg ggctctgaac 3900 

ctggcgcaaa atggattgct gttaaagcgt 3960 

ttttggaagc tggtgaatgg gttttagcac 4020 

aaatggctcc tgatgttgtc aataactcat 4080 

acagagacat ggtcaatgcc tggcgttcgg 4140 

atacggatct ctttattccc ggcgggcctg 4200 
aatcgtttgc aactggagcg actgagaatt 4260 

ctaatgttga ttactttgaa cttctgcata 4320 

aagattgtgc tgaaatatta gagtataaac 4380 

atcgagtgtg gttttgtaaa tccaggcttt 4440 



WO 01/21772 



gtccaatgtg 


caactggagg 

1 i 


agagcaatga 


ctgaagttat 


taaacaaaag 

1 i - 


» i 
ccaacagttc 


atqtttatqa 


tggcgaagaa 


ttaaataaga 


gaatgatgca 


atataaaaaa 

i 


attaataaaa 


tgacaataaa 


i 

taataaaqat 


aattcttata 


aaccaactta 


ttttaagaat 


acagaaaactt 


ggaaaaaggc 


* ' 

aat'gaaatta 

i 


gactatgatc 


aaaat aaata 


t aaatcggat 


atacaatcgg 


aaaatacaaa 


t t t tat aacc 


era t crataaaa 


aaaaaaottt 


acaccataaa 


aaattaatct 


aaaaattaaa 

1 


ccttoatoac 


acaaaaaaaa 

%^p1 ^pM 


aaaccaatca 


aaataaattt 


tctattatta 


V-* &4 ^» £-« t-J t»J V^j 


ataottcaac 


t*A Vrif ^p1 ^p4 Vj ^p^ ^pW wl 


ttaaaactta 


a t ttat a tat 


aataaaatat 

^pj ^P> ^P>* 


aaotataata 


aatqtaacaa 


ggtt caa tta 


actacatcta 


ccct ca t t at 


t acta cr era t aa 


ttacgacaca 


accaaatt tt 


aatttggct t 


gaaatggttt 


t gaacgtctc 


attacctgat 


agt acaaact 


gggtgaacac 


ageett t atg 


gqaaagctat 


ctgatcaatt 


agqcatcaaa 


tgtttcgggt 


cggtaattgg 


gtttgttggc 


cgttttattc 


aaggggctgg 


tqeagctgea 


cgctatattc 


caaaggaaaa 


t aggggtaaa 

^ ^ W 


atgggagaag 


gagtcggtcc 


agcgattggt 


tatcttctac 


tcatt cctat 


gataacaatt 


aagaaagaag 


taaggataaa 


aggtcatttt 


ggcattgtat 


tttttatgtt 


gtttacaaca 


gtgctgtcat 


tcctgatatt 


tgtaaaacat 


cccggattag 


ggaaaaatat 


accttttatg 
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aaqatcjgcat tcagtcacaa aaggttgttg 4500 
gttggttgtt tctcacatta acagttaaaa 4560 
gtttgtcaga tatggctcaa ggatttcgee 4 620 

i 

atcttgttgg ttttatgcgt geaaeggaag 4 680 

i* i 

atcagcacat gcatgtattg gtatgtgtgg 4740 
aegtgaatea aaaacaatgg attcaatttt 4800 
caaatgt'aaa agtt.caaatg attcgaccga 4860 * 

i 

j 

caat'tgacga aactgeaaaa tatcctgtaa 4 920 
aaaagaattt gaaacgtttg tctgatttgg 4980 
cctatggtgg tttgttaaaa gaaatacata 5040 
gcgatttgat tcatacagat gatgacgaaa 5100 
caatgtggaa ttgggaacgg aaaaattatt 5160 
tattgttgta taagtgatga aatactgaat 1 5220 
tttaatcaag tttaggagga attaattatg 5280 
aaagagggaa gegtatcatt aaccctataa 5340 
aatgtgaata catcctattc acaatcgaat 5400 
tgcattttat ctttttttag cgtattaaat 5460 
attgeaaatg attttaataa accacctgcg 5520 
ttaacctttt ccattggaac agctgtatat 5580 
aggttactcc tatttggaat tataataaat 5640 
cattctttct tttccttact tattatggct 5700 
tttccagcac tcgtaatggt tgtagttgcg 5760 
gcatttggtc ttattggatc gatagtagee 5820 
ggaatgatag cccattatat tcattggtcc 5880 
atcactgttc cgtttcttat gaaattatta 5940 
gatatcaaag gaattatact aatgtctgta 6000 
tcatatagca tttcttttct tategttage 6060 
atcaggaaag taacagatcc ttttgttgat 6120 
attgga,gttc tttgtggggg aattatattt 6180 
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ggaacagtag cagggtttgt ctctatggtt ccttatatga tgaaagatgt tcaccagcta 6240 

agtactgccg aaatcggaag tgtaattatt ttccctggaa caatgagtgt cattattttc 6300 

i ■ 1 

ggctacattg gtgggatact tgttgataga agaggtcctt .tatacgtgtt aaacatcgga 6360 

gttacatttc tttctgttag ctttttaact gcttcctttc ttttagaaac aacatcatgg 6420 

ttcatgacaa ttataatcgt atttgtttta ggtgggcttt cgttcaccaa aacagttata 6480 

'tcaacaattg. tttcaagtag cttgaaacag caggaagctg gtgctggaat gagtttgctt 6540 

. aactttacca gctttttatc agaggjgaaca ggtattgcaa ttgtaggtgg tttattatcc 6600 

i 

atacccttac ttgatcaaag gttgttacct atggaagttg atcagtcaac ttatcitigtat 6660 

agtaatttgt tattactttt ttcaggaatc attgtcatta gttggctggt taccttgaat 6720 

gtatataaac attctcaaag ggatttctaa atcgttaagg gatcaacttt gggagagagt 6780 

tcaaaattga tccttttttt ataacagttc gaagcggccg caattcttga agacgaaagg 6840 

gcctcgtgat acgcctattt ttataggtta atgtcatgat aataatggtt tcttagacgt 6900 

caggtggcac ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac 6960 

attcaaatat gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa 7020 

aaaggaagag tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat 7080 

-tttgccttcc tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc 7140 

agttgggtgc acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga 7200 

gttttcgccc cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg 7260 

cggtattatc ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc 7320 

agaatgactt ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag 7380 

taagagaatt atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc 7440 

tgacaacgat cggaggaccg aaggagctaa ccgctttttt gcacaacatg ggggatcatg 7500 

taactcgcct tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg 7560 

acaccacgat gcctgcagca atggcaacaa cgttgcgcaa actattaact ggcgaactac 7620 

ttactctagc ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac 7680 

cacttctgcg ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg 7740 

agcgtgggtc tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg 7800 

tagttatcta cacgacgggg agtcaggcaa ctatggatga acgaaataga cagatcgctg 7860 

agataggtgc ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac 7 920 
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agcccagctt ggagcgaacg acctacaccg aactgagata cctacagcgt gagctatgag 84 60 



gcctatggaa aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt 8700 

ttgctcacat gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct 8760. 

ttgagtgagc tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg 8820 

aggaagcgga agagcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac 8880 

accgcatatg gtgcactctc agtacaatct gctctgatgc cgcatagtta agccagtata 8940 

cactccgcta tcgctacgtg actgggtcat ggctgcgccc cgacacccgc caacacccgc 9000 

tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag ctgtgaccgt 9060 

ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg cgaggcagct 9120 

gcggtaaagc tcatcagcgt ggtcgtgaag cgattcacag atgtctgcct gttcatccgc 9180 

gtccagctcg ttgagtttct ccagaagcgt taatgtctgg cttctgataa agcgggccat 9240 

gttaagggcg gttttttcct gtttggtcac ttgatgcctc cgtgtaaggg ggaatttctg 9300 

ttcatggggg taatgatacc gatgaaacga gagaggatgc tcacgatacg ggttactgat 9360 

gatgaacatg cccggttact ggaacgttgt gagggtaaac aactggcggt atggatgcgg 9420 

cgggaccaga gaaaaatcac tcagggtcaa tgccagcgct tcgttaatac agatgtaggt 9480 

gttccacagg gtagccagca gcatcctgcg atgcagatcc ggaacataat ggtgcagggc 9540 

gctgacttcc gcgtttccag actttacgaa acacggaaac cgaagaccat tcatgttgtt 9600 

gctcaggtcg cagacgtttt gcagcagcag tcgcttcacg ttcgctcgcg tatcggtgat 9660 



atcctttttg 


7980 

i 


i 

tcagaccccg 


8040 


tgctgcttgc 


8100 


ctaceaactc 


8160 


cttctagtgt 


8220 


i 

ctcgctctgc 


8280 


qqqttggact 


8340 


tcatqc'acac 


8400 


gagctatgag 


8460 


oocaaQQ t co 


8520 


t a taat cct a 


8580 




8640 


tactaocctt 


8700 


attaccacct 


8760 


tcao t ciaa ca 


8820 


ggtatttcac 


8880 


aaccaqtata 


8940 


caacacccgc 


9000 


ctataaccat 


9060 


cgaggcagct 


9120 


gttcatccgc 


9180 


agcgggccat 


9240 


ggaatttctg 


9300 


ggttactgat 


9360 


atggatgcgg 


9420 


agatgtaggt 


9480 


ggtgcagggc 


9540 


tcatgttgtt 


9600 


tatcggtgat 


9660 
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tcraattctt 
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y Ly y vci-y y 


agacaaggta 


tagggcggcg 

L 


cctacaatcc 


cggcataaat 


I 

cgccgtgacg 


ateageggtc 


cgagcgatcc 


ttgaagctgt 


ccctgatggt 


gcaacgcggg 


catcccgatg 


ccgccggaag 


agcctcgcgt 

i 


i 

eg 




<210> 82 
<211> 10426 
<212> DNA 

<213> Artificial Sequence 


<220> 

<223> Description of Artificial 
pAN2 67 plasmid * , 


<400> 82 
aacaaaattc 


tccagtcttc 


acatcggttt 


ggga tttttg 


actccgaagt 


aagtcttcaa 


gcaaaacgat 


tcaaaacctc 


tttactgccg 


ttggttctgg 


caggaccggc 


ggctgcgagt 


acagcaccgt 


cgatcaaaag 


eggaaccatt 


tt aaaacaca 


atatgaagga 


tattcatgat 


attaaccaag 


taaaggaagg 


gaatcaagga 


tatcagccga 


categtatea 


aattggcaac 


gaaatgtgtg 


cagccgctga 


agaatatggc 


ca taccacca 


gtgattatgc 


cgcgatttcc 


catggaaaca 


cacaaattaa 


aaactggtct 


cgcgaaatac 


gggcagacat 


ggcctgcccg 


ggtaatggta 


gcgaccggcg 


ctcaggatcg 


aaaggccatc 


cgtcaggatg 


gccttctgct 
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cagcctagcc gggtcctcaa cgacaggagc 9720 

. 1 1 .i .- - 

i ■ 

acgctgccpg agatgegecg cgtgcggctg 9780 

tgccaagggt tggtttgcgc attcacagtt 9840 

ggagtggtga ateegttage gaggtgeege 9900 

tccatgcacc gcgacgcaac gcggggaggc 9960 

atgccaaccc gttccatgtg ctcgccgagg 10020 

cagtgat.cga agttaggctg gtaagagccg 10080 t 

cgtcatctac ctgcctggac ageatggect 10140 

cgagaagaat cataatgggg aaggccatcc 10200 

10212 



Sequence: Recombinant 



gaaaggagga 


ageggaagaa 


cgaagtaaga 


60 


aaaatcaaat 


aaggagtgtc 


aagaatgt 1 1 


1.20 


ttattcgctg 


gatttttatt 


gctgtttcat 


180 


getgaaaegg 


cgaacaaatc 


gaatgagctt 


240 


ettcatgeat 


ggaattggtc 


gttcaatacg 


300 


gcaggatata 


cagccattca 


gacatctccg 


360 


gataaaagca 


t gtcgaactg 


gtactggctg 


420 


cgttacttag 


gtactgaaca 


agaatttaaa 


480 


ataaaggtca 


ttgttgacgc 


ggtcatcaat 


540 


aatgaggtta 


agagtattcc 


aaactggaca 


600 


gatcgaaata 


gtacataatg 


gatttcctta 


660 


gttattatta 


tttttgacac 


cagaccaact 


720 


tctcggtacc 


aagagtttgt 


agaaaegcaa 


780 


taatttgatg 


cctggcagtt 


tatggeggge 


840 
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'gtcctgcccg ccaccctccg ggccgttgpt tcgcaacgtt caaatccgct cccggccjgati 900 

ttgtcctact caggagagcg ttcaccgaca aacaacagat' aaaacgaaag gcccagtctt 960 

tcgactgagc ctttcgtttt atttgatgcc tggcagttcc ctactctcgc atggggagac 1020 

cccacactac catcggcgct acggcgtttc acttctgagt tcggcatggg gtcaggtggg 1080 

' accaccgcgc tactgccgcc aggca'aattc tgttttatca gaccgcttct gcgttctgat 1140 

ttaatctgta tcaggctgaa aatcttctct catccgccaa aacaggatcc atcacgaagc 1200 

- " ' ■ ! ■ '' 

gtcgtatcga aaaaattaat tttgcgcaac ggagaccacc gcttccttct tcttqccttg 1260 

tttcacaaac ggcatcattt cacgaagctt tcttcccact acttcgattt gatgttcgtt 1320 

ctcgcttgca ttgatagcgt tgaaacgagg acggtttact tggttttcga cgatccactc 1380 

ttttgcgaat gtaccgtttt ggatatcttt taatacttct ttcatagatt cttttacttt 1440 

agcgtccaca acgcgagggc ctgs tacgaa atctccccac tytgctgtgt cagagattga 1500 

i 1 

atatctcatt cctgcaagtc cttcttcgta cataaggtct acgattaatt tcagctcatg 1560 

aagacactcg aagtatgcaa gttcaggctg ataacctgct tcagttaagg tttcaaatcc 1620 

ggctttgaca agcgcgctta atccgccgca aagaactgct tgctcaccga acaaatctgt 1680 

ttctgt ttct tctttaaatg tcgtttctaa tacgcccgct cttgcgccgc cgattqcttt 1740 

agcataagcg agggctttgt ctcttgcttc tccagtcaca tcttgataga ttgcgaacaa 1800 

tgcaggtacg ccagctcctt gctcatatgt tcttcttacc aagtgtcccg ggcctttagg 1.860 

ggccactaag aatacatcta catccgccgg aggaacaatt tgatggaaat gcacgttaaa 1920 

tccatgagcg aatactaatg attttcctgc tgtcaattca tctttgattt cagcttcgta 1980 

tactttttgc tgctgctcat ccggaagcag aaccatgatg atttcggctt gggctgccgc 2040 

ttcttttact gaaaatactt tatgtccgtc ttcttgggct tgagtgaaag attttccttg 2100 

tctaacaccg acgatcacgt ctactccgct ttctttaagg ttcagggcat gtgcgtggcc 2160 

ttgcgaaccg tacccgataa ccgctactgt ttttccagcc aatacgttct ctttgatatc 2220 

accgttataa tatactttta ccatttcaat ctctcccttg ttatgtttta tacaatagat 2280 

attgttttat tggatgacgc cttttgctgg ttcccctcgc aaaagccgtt gtacctgttc 2340 

tcgcgatttc tttaatgcca taaggtttta ataactcaat aagcgcttca attttgttag 2400 

attcacctgt cacctgaaca acgatgctgt ctctgctgac atcaacgaca gaggctctaa 2460 

acggttctat gattccatta atctctgttc ttgttgaagg tgcggagaca accttgatta 2520 

aggccagctc cctctggaca atcgattgat ttgtgatgtc tgtgactttc agcacatcaa 2580 
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* p 

tctgtttgtt gagetgtttc gttaactgtt caacatcatt ittcaccttca. acatgaacga 2640 
cgaaggtgat tctggaaacg ccggctgttt ctgtgtgtcc aactgtaatg ctttcaatgt 2700 
tgtaatgcct ttttgtgaat agaccggtga tccggtttaa caccccggag cggttcacca 2760 



cagtcaatgt gataattctt ttcaaggttt cacccccacc atttcatgca gccctttccc 2820 



cggagccacc atcgggaata ctttttcttc gctggcaacc cgcacgtca'a tgacaacagg 2880 

' • | ■ , ■ ■ 

ttctcttgat gttaatgcct cttccagctt ttcctttgct tccgcttctg atgaaattct 2940 

gatgccttta atg'ccgtatg cttcggacaa tttgacgaag tcaggctgag aagcgaattt 3000 

, ■ i 

agattctgaa taacgttctt catagaaaat ttcctgccac tgtctgacca ttccgagaca 3060 

i 

4 1 
j t 

agcgttattt aaaatcacta ccttgaccgg aagatttaat tcgcgaataa catcgagttc 3120 
ttgaagcgtc atttggaatc cgccgtctcc gacaaccgcg acaacagtag catctttttc 3180 



ggccagctgt gcgccgatcg ccgccggaag accgaatccc atcgttccaa jgtccgcctga 3240 



cgtgacccat ttatctgctt tttggaacgg ataaaattgc gctgaccaca tttgatgctg 3300 



gcctacatcc gttgcgacaa tggcctctcc ttttgtaaat tgatgaatat attcaatcaa-' 3360 



tttctgaggt ttaaaacctt cttcttcatt atcracatac cagagcggat actcttcttt 3420 



ccattctgcg agctgttttt tccattcgct tgaatcgctt tgtttgccgt cttgtttgat 3480 



cagctcctgc aggacaattt tgctgtctcc gactacagga atctgtgttt tcatgatttt 3540 



tccgatttca gctggatcaa tatcgatgtg ggctatcttt gcgtttctgg caaagtgttt 3.600 



caggtttcct gtgacacggt catcaaaacg ggcgccgata ctgattaata gatcacattc 3660 



atgaagggcc atattggctg tataagtacc gtgcattccc gccatcccta ggaaaagcgg 3720 



atggtcagcc gggaagcctc cgagccccaa aagggtgtgt gccacaggga tttgctgctg 3780 



ttcagcataa ttttttaatt cttctgacgc ttttccgtgc agtacgcccg cacccgccag 3840 



gatcaccggt tttttcgcac tgctcacggc ttccacaagc ttgcggatct gcaaataatt 3900 



cggctctgtt gtcggctggt atcccgggag attcatctca tgatcgtagc tgaattctcc 3960 



ttcaattgtt gctacatctt tcggaatatc aatcaataca ggtccgggtc ttccagttgt 4020 



tgcaatatgg aacgcttctt taatgatgcg cggcagatct tccggctggc gaacctggta 4080 



gctgtgtttt gttactggca tcgtaatccc taaaatgtct gcttcctgaa atgcatcgct 4140 



cccgattaca gaggttgcta cctgccctgt aaagacgact aacggcaatg aatcaatcat 4200 



ggcatcagca aggcctgtaa caaggtttgt cgctcccggc ctgacgtggc aatgacgaca 4260 



ccggtttccg gagacccttg cgtatccctc cgctgcatga attgctcctt gttcgtgacg 4320 
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ggaaggatat gtaccaacct gaatgtatag cttatcgtaa atcggaagc'a pagccccgcc 4 380 

cggataaccg aagatcattt eta cttt etc tttttttaat gatteaatca gcattaatcg 4440 

tccgctcatc gtctgtgtac attcggcaga tgctgaatcc acctgtacat tagtccccat 4500 

tttatctcct cctctagagc gtcctgctgt , tgttaagatt' attataccac accttgtaga 4560 

1 ii . , ■ ' i 

taaagtcaac aactttttgc aaaatttttc aggaatttta gcagaggttg ttctggatgt 4 620 

i | i ■ . 

agaacaaaac atctttccgc tcttgtgctg ttaggatatc tttcttggaa gctaggtagg 4 680 

i ' 

cctcgagtta tggcagttgg ttaaaaggaa acaaaaagac cgttttcaca caaaacggtc 4740 

tttttcgatt tctttttaca gtcacagcca cttttgcaaa aaccggacag cttcatgcct 4800 

tataactgct gtttcggtcg acgaagcggc cgccgtttaa acgaattcct gcagctggcg 4860 

aatggcgktt ttcgttcgtg aatacatgtt ataataacta taactaataa cgtaacgtga 4920 

ctggcaagag atatttttaa aacaatgaat aggtfctacac ttactttagt tttatggaaa 4 980 

tgaaagatca tatcatatat aatctagaat aaaattaact aaaataatta ttatctagat 5040 

aaaaaattta gaagccaatg aaatctataa ataaactaaa ttaagtttat ttaattaaca 5100 

actatggata taaaataggt actaatcaaa atagtgagga ggatatatt.t gaatacatac 5160 

gaacaaatta ataaagtgaa aaaaatactt cggaaacatt taaaaaataa ccttattggt 5220 

acttacatgt ttggatcagg agttgagagt ggactaaaac caaatagtga tcttgacttt 5-280 

ttagtcgtcg tatctgaacc attgacagat caaagtaaag aaatacttat acaaaaaatt 5340 

agacctattt caaaaaaaat aggagataaa agcaacttac gatatattga attaacaatt 5400 

attattcagc aagaaatggt accgtggaat catcctccca aacaagaatt tatttatgga 54 60 

gaatggttac aagagcttta tgaacaagga tacattcctc agaaggaatt aaattcagat 5520 

ttaaccataa tgctttacca agcaaaacga aaaaataaaa gaatatacgg aaattatgac 5580 

ttagaggaat tactacctga tattccattt tctgatgtga gaagagccat tatggattcg 5640 

tcagaggaat taatagataa ttatcaggat gatgaaacca actctatatt aactttatgc 5700 

cgtatgattt taactatgga cacgggtaaa atcataccaa aagatattgc gggaaatgca 5760 

gtggctgaat cttctccatt agaacatagg gagagaattt tgttagcagt tcgtagttat 5820 

cttggagaga atattgaatg gactaatgaa aatgtaaatt taactataaa ctatttaaat 5880 

aacagattaa aaaaattata aaaaaattga aaaaatggtg gaaacacttt tttcaatttt 5940 

tttgttttat tatttaatat ttgggaaata ttcattctaa ttggtaatca gattttagaa 6000 

aacaataaac ccttgcatag ggggatcgat atccgtttag gctgggcggt gatagcttct 6060 
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cgttcaggca 


gtacgcctct 


tttcttttcc 


agacctgagg 


gaggcggaaa 

i 


tggtgtgagg i 

t 

r i 


6120 


ttcccgggga 


aaagccaaat 


aggcgatcgc 


gggagtgctt 


t 

tatttgaaga 


I 

tcaggct,atc 


6180 


actgcggtca 


atagatttca 


caatgtgatg 


gctggacagc 


ctgaggaact 


ctcgaacccg 


6240 


aatggaaaca 


accagatatt 


tatgaatcag , 


cgcggctcac 


atggcgttgt 


gctggcaaat 


6300 


gcaggttcat 


cctctgtctc 


tatcaatacg 


gcaacaaaat 


tgcctgatgg 

* 


caggtatgac 


6360 


aataaagctg 


gagcgggttc 


atttcaagtg 

I 


i 

aacgatggta 


i 

aactgacagg 


i 

cacgatcaat 


6420 


i 

gccaggtctg 


tagctgtgct 


1 

ttatcctgat 


gatattgcaa 


aagcgcctca 


tgttttcctt 

1 i.* . 


6480 


gagaattaca 


aaacaggtgt 


aacacattct 


ttcaatgatc 


aactgacgat 


taccttgcgt 


6540 


gcagatgcga 


atacaacaaa 


agccgtttat 


caaatcaata 


atggaccaga 


cgacaggcgt 


6600 


ttaaggatgg 


agatcaattc 


acaatcggaa 


aaggagatcc 


aatttggcaa 


aacatacacc 


6660 


atcatgttaa 


aaggaacgaa 


cagtgatggt 

1 t 


gtaacgagga 


ccgagaaat a 


cagttttgtt 


6720 


aaaagagatc 


cagcgtcggc 


t 

caaaaccatc 


ggctatcaaa 


atccgaatca 


ttggagccag 


6780 


gtaaatgctt 


atatctataa 


acatgatggg 


agccgagtaa 


ttgaattgac 


cggatcttgg 


6840 


cct ggaaaac 


caatgactaa 


aaatgcagac 

• 


ggaatttaca 


cgctgacgct 


gcctgcggac 


6900 


acggatacaa 


ccaacgcaaa 


agtgattttt 


aataatggca 


gcgcccaagt 


gcccggtcag 


6960 


aatcagcctg 


gctttgatta 


cgtgctaaat 


ggtttatata 


atgactcggg 


cttaagcggt 


7 020 


tctct tcccc 


attgagggca 


aggctagacg 


ggacttaccg 


aaagaaacca 


tcaatgatgg 


7080 


tttctttttt 


gttcataaat 


cagacaaaac 


ttttctcttg 


caaaagtttg 


tgaagtgttg 


7140 


cacaatataa 


atgtgaaata 


cttcacaaac 


aaaaagacat 


caaagagaaa 


cataccctgc 


7200 


aaggatgctg 


atattgtctg 


catttgcgcc 


ggagcaaacc 


aaaaacctgg 


tgagacacgc 


7260 


cttgaattag 


tagaaaagaa 


cttgaagatt 


ttcaaaggca 


tcgttagtga 


agtcatggcg 


7320 


agcggatttg 


acggcatttt 


cttagtcggg 


cggcacctcg 


ctaacggatt 


caccactcca 


7380 


agaattggag 


ccaatcaatt 


cttgcggaga 


actgtgaatg 


cgcaaaccaa 


cccttggcag 


7440 


aacatatcca 


tcgcgtccgc 


catctccagc 


agccgcacgc 


ggcgcatctc 


gggcagcgtt 


7500 


gggtcctggc 


cacgggtgcg 


catgatcgtg 


ctcctgtcgt 


tgaggacccg 


gctaggctgg 


7560 


cggggttgcc 


ttactggtta 


gcagaatgaa 


tcaccgatac 


gcgagcgaac 


gtgaagcgac 


7620 


tgctgctgca 


aaacgtctgc 


gacctgagca 


acaacatgaa 


tggtcttcgg 


tttccgtgtt 


7680 


tcgtaaagtc 


tggaaacgcg 


gaagtcagcg 


ccctgcacca 


ttatgttccg 


gatctgcatc 


7740 


gcaggatgct 


gctggctacc 


ctgtggaaca 


cctacatctg 


tattaacgaa 


gcgctggcat 


7800 



i 
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tgaccctgag tgatttttct ctggtcccgc cgcatccata ccgccagttg tttaccctcai 7860 

caacgttcca gtaaccgggc atgttcatca^ tcagtaaccc gtatcgtgag catcctctct 7920 

i 

cgtttcatcg gtatcattac ccccatgaac agaaatcccc cttacacgga ggcatcagtg 7980 

accaaacagg aaaaaaccgc ccttaacatg gcccgcttta tcagaagcca gacattaacg 8040 

cttctggaga aactcaacga gctggacgcg gatgaacagg cagacatctg tgaatcgctt 8100 

cacgaccacg ctgatgagct ttaccgcagc tgcctcgcgc gtttcggtga tgacggtgaa 8160 

' ' ! 

aacctctgac acatgcagct cccggagacg gtcacagctt gtctgtaagc ggatqccggg 8220 

IT' 

I 

\ t 

agcagacaag cccgtcaggg cgcgtcagcg ggtgttggcg ggtgtcgggg cgcagccatg 8280 

acccagtcac gtagcgatag cggagtgtat actggcttaa ctatgcggca tcagagcaga 8340 

ttgtactgag agtgcaccat atgcggtgtg aaataccgca cagatgcgta aggagaaaat 8400 

accgcatcag gcgctcttcc gcttcetcgc tcactgactc gctgcgctcg gtcgttcggc 84 60 

i ' 

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg 8520 

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg 8580 

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac 8640 

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg 8700 

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct 8-760 

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg 8820 

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct 8880 

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac 8940 

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt 9000 

tcttgaagtg gtggcctaac tacggctaca ctagaaggac agtatttggt atctgcgctc 9060 

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca 9120 

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat 9180 

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gaaaactcac 9240 

gttaagggat tttggtcatg agattatcaa aaaggatctt cacctagatc cttttaaatt 9300 

aaaaatgaag ttttaaatca atctaaagta tatatgagta aacttggtct gacagttacc 9360 

aatgcttaat cagtgaggca cctatctcag cgatctgtct atttcgttca tccatagttg 9420 

cctgactccc cgtcgtgtag ataactacga tacgggaggg cttaccatct ggccccagtg 9480 

ctgcaatgat accgcgagac ccacgctcac cggctccaga tttatcagca ataaaccagc 9540 
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cagccggaag 


ggccgagcgc .agaagtggt'e 


ctgcaacttt 


,atccgcctcc 


atccagteta 

* ,i 


9600 


ttaattgttg 


ccgggaagct 


agagtaagta 


gttcgccagt 


taatagtttg 

* 


cgcaacgttg 

■I 


9660 


ttgccattgc 


tgcaggcatc 


gtggtgtcac 


gctcgtcgtt 


tggtatggct 


tcattcagct 


9720 


ccggttccca 


acgatcaagg 

j 

tcctccgatc 


cgagttacat 


gatcccccat 


gttgtgcaaa 


aaagcggtta 

i 


9780 


gctccttcgg 


gttgtcagaa 


gtaaigttcjgc 


cgcagtgttia 


tcactcatgg 


9840* 


ttatggcagc 


actgcataat 


\ • i 

tctcttactg 


tcatgccatc 


cgtaagatgc 


ttttctgtga 


9900 


i 

ctggtgagta 


ctcaaccaag 

i 


tcattctgag aatagtgtat 


gcggcgaccg 


agttgctctt 


9960 


gcccggcgtc 


aacacgggat 


aataccgcgc 


i 

cacatagcag 


aactttaaaa 


gtgctcatca 


10020 


ttggaaaacg 


ttcttcgggg 


cgaaaactct 


caaggatctt 


accgctgttg 


agatccagtt 


10080 


cqatqtaacc 


cactcgtgca 


cccaactgat 


cttcagcatc 


ttttactttc 


accagcgttt 


10140 


ctgggtgagc 


aaaaacagga 


aggcaaaatg 


ccgcaaaaaa 


gggaataagg 


gcgacacgga 


10200 


aatgttgaat 


act catactc 

i 


ttcctttttc 


aatattattg 


aagcatttat 


cagggttatt 


10260 


gt ctca tgag 


eggatacata 


tttgaatgt a 


tttagaaaaa 


taaacaaata 


ggggt tccgc 


10320 


gcacatttcc 


ccgaaaagtg 


ccacctgacg 


tct aagaaac 


cattattatc 


atgacattaa 


10380 


cctataaaaa 


taggcgtatc 


acgaggccct 


ttcgtcttca 


agaatt 




10426 



<210> 83 
<211> 4191 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN263 plasmid 



<400> 83 
ccaatacgca 


aaccgcctct 


ccccgcgcgt 


tggccgattc 


attaatgcag 


cccggatccg 


60 


aaagaagagg 


atatcccggt 


tttacagaag 


gcattggatg 


atccaaaggt 


gtccatcaga 


120 


agacaggctg 


ttgtgtactt 


aggaatgatt 


gaaacacctg 


atgttcttcc 


tctattgtat 


180 


aaagcacttg 


aggacaaagc 


tgtatcagtc 


agaagaacgg 


ccggagactg 


cctgtctgat 


240 


atcggcgatc 


ctcaagccat 


tcct gctatg 


atcaagtcat 


taagcgactc 


cagcaagctt 


300 


gttcgctggc 


gtgccgccat 


gttcctgtac 


gaagtcggcg 


atgaaagtgc 


aattgaagct 


360 


ttgcgcgctg 


ccgaagatga 


ccccgaattt 


gaggtcagcc 


ttcaagtcaa 


aatggcgctt 


420 


gaacgtattg 


agcatggaga 


agaagcaaaa 


ggttctgttt 


ggaaacaaat 


gacggaaagc 


480 
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i 

agaaaaaaag gcgaataaag ataaaaaagg tgcagatcat gcaccttttt tatgtgaatt 540 

1 i ' . f 

ggtcgaccga aacagcagtt ataaggcatg aagctgtcfcg gtttttgcaa aagtggctgt 600 
gactgtaaaa agaaatcgaa aaagaccgtt ttgtgtgaaa acggtctttt tgtttccttt 660 

i 

taaccaactg ccataactcg aggcctacct agcttccaag aaagatatcc taacagcaca 720 
agagcggaaa gatgtttjtgt tctacatcca gaacaacctc tgctaaaatt cctgaaaaat 780 

tttgcaaaaa gttgttgact ttatcta'caa ggtgtggtat aataatctta acaacagcag 840 

• p 

gacgctctag aggaggagac accatggcag aattacgcacj taatatgatc acacaaggaa 900 

i .i ■ ■ 

tcgatagagc tccgcaccgc agtttgcttc gtgcagcagg ggtaaaagaa gaggatttcg 960 
gcaagccgtt tattgcggtg tgtaattcat acattgatat cgttcccggt catgttcact 1020 
tgcaggagtt tgggaaaatc gtaaaagaag caatcagaga agcagggggc gttccgtttg 1080 
aatttaatac cattggggta gatgatggca tcgcaatggg gcatatcggt atgagatatt 1140 
cgctgccaag ccgtgaaatt atcgcagact ctgtggaaac ggttgtatcc gcacactggt 1200 
ttgacggaat ggtctgtatt ccgaactgcg acaaaatcac accgggaatg cttatggcgg 1 12 60 
caatgcgcat caacattccg acgatttttg tcagcggcgg accgatggcg gcaggaagaa 1320 
oaagttacgg gcgaaaaatc tccctttcct cagtattcga aggggtaggc gcctaccaag 1380 
cagggaaaat caacgaaaac gagcttcaag aactagagca gttcggatgc ccaacgtgcg 1440 
gqtcttgctc aggcatgttt acggcgaact caatgaactg tctgtcagaa gcacttggtc 1500 
ttgctttgcc gggtaatgga accattctgg caacatctcc ggaacgcaaa gagtttgtga 1560 
gaaaatcggc tgcgcaatta atggaaacga ttcgcaaaga tatcaaaccg cgtgatattg 1620 
ttacagtaaa agcgattgat aacgcgtttg cactcgatat ggcgctcgga ggttctacaa 1680 
ataccgttct tcataccctt gcccttgcaa acgaagccgg cgttgaatac tctttagaac 1740 
gcattaacga agtcgctgag cgcgtgccgc acttggctaa gctggcgcct gcatcggatg 1800 
tgtttattga agatcttcac gaagcgggcg gcgtttcagc ggctctgaat gagctttcga 1860 
agaaagaagg agcgcttcat ttagatgcgc tgactgttac aggaaaaact cttggagaaa 1920 
ccattgccgg acatgaagta aaggattatg acgtcattca cccgctggat caaccattca 1980 
ctgaaaaggg aggccttgct gttttattcg gtaatctagc tccggacggc gctatcatta 2040 
aaacaggcgg cgtacagaat gggattacaa gacacgaagg gccggctgtc gtattcgatt 2100 
ctcaggacga ggcgcttgac ggcattatca accgaaaagt aaaagaaggc gacgttgtca 2160 
tcatcagata cgaagggcca aaaggcggac ctggcatgcc ggaaatgctg gcgccaacat 2220 
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tggaatggga 


CLcgggccaa 


aaguggcact 

i 


t~t f> a /-» /-» « m 

yducdcyyaC 

• i ■ ! , » 


/^r a n 4^ 4— 4— 4— 4— 

ggacg lllll 






/*» 4- /»■ /~f « /■» 4» y-» 


i 

4~ *~\ ^ 4» >— < f*m 

LCddtCygCC 


+ 

acgcaucacc 

1 , 


tgaggccgcc 

) 


i 

gagggcgggc 






uy l ugaaaac 


j"B B^ _*» -"-K >k Bj B) -B^ 

ggagacca t a 


ctaccgttga 


^"4 ^ ' ""^ ■'"I B^ B*^ ^B. 

. uau xgaaaaa 


cgcat cttgg 


^ 4 UU 


«*S V* A* A ^L— 

atgtacaag.t 


gccagaagaa 


gagtgggaaa 


aacgaaaagc 


i 

gaactggaaa 


ggtt ttgaac 


24 60 


cgaaagt gaa 


aaccggctac 


ctggcacgtt 


attctaaact 


tgtgacaagt 


gccaacaccg 


2520 


b*bt jB™* ^« bbl ^^b «Bk b^bi 

gcggt at t at 


g a aaatctag 


Bl Bi .bMb Jta ^^B ,BBB ^bbi 

acccctggcg 


taatagcgaa 


gaggcccgca 


ccgatcgccc 


O C O A 

2580 


t tcccaacag 


ttgcgcagcc 


tgaatggcga 


at gagcttgc 


gccgtcccgt 


caagtcagcg 


2640 


taatgctctg 


ccagtgttac 


i 

aaccaattaa 

j 


ccaattctga 


ttagaaaaac 


tcatcgagca 


2700 


t caaatgaaa 


ctgcaattta 


i 

ttcatatcag 


gat tatcaat 


accatatttt 


tgaaaaagcc 


B*fc *4 ,B*> B^ 

2760 


J I X 1 ft 

gtttctgtaa 


tgaaggagaa 


aactcaccga 


ggcagttcca 


taggatggca 


agatcctggt 


2820 


atcggtctgc 


gattccgact 


cgtccaacat 


caatacaacc 


tat taatttc 


ccctcgtcaa 


2880 


aaataaggtt 


atcaagtgag 


i , 

aaatcaccat 


gagtgacgac 


tgaatccggt 


gagaatgcjca 


2940 


aaaggtt atg 


catttctttc 


cagacttgtt 


caacaggcca 


gccattacgc 


tcgtcatcaa- 


3000 


aatcactcgc 


atcaaccaaa 


ccgttattca 


ttcgtgattg 


cgcctgagcg 


agacgaaata 


3060 


cgcgatcgct 


gttaaaagga 


caattacaaa 


caggaatcga 


atgcaaccgg 


cgcaggaaca 


3120 


ctgccagcgc 


at caacaata 


ttttcacctg 


aatcaggata 


t tct tctaat 


acctggaatg 


3180 


ctgtt tt ccc 


agggatcgca 


gtggtgag ta 


_4_ ^ 

accatgcatc 


a tcaggagta 


cggataaaat 


3240 


gcttgatggt 


cggaagaggc 


ataaattccg 


tcagccagtt 


tagtctgacc 


atctcatctg 


j'300 


taacatcatt 


ggcaacgcta 


cctttgccat 


gtttcagaaa 


caactctggc 


gcatcgggct 


3360 ' 


t ccca tacaa 


t caat aga tt 


gtcgcacctg 


at t gcccgac 


attat cgcga 


gcccatttat 


34 20 


acccatataa 


at cagca t cc 


at gt tggaat 


t t aatcgcgg 


cctcgacgag 


caagacgttt 


34 oU 


CCCy l Lyddt 


>"Bj -b^ a— l ^B- bb» 

a uggctcata 


acacccct tg 


t a u i. a c u g tl h 


rat gt aagca 


gacagtttta 


304 U 


4" 4* /-f 4" 4- /-* q 4- o 


f-naf 3 f -v 4- 4- 4- 
CydLaLaLt L 


LtdtCtLgty 


Caacy taaca 


ccagagatuu 


tgagacactc 


Otf AA 
JOUU 


ycic_-ciciycii_.yci 


4" /■* 4" 4" /™*4- 4* o /r 


a 4^ A> /"r 4-4—4-4- y-* 

dLCy LLLLyy 


4* /™* 4" /■* r*t 4" a 

tctycgcyLd 


O 4" 4" n 4* 4— 4— 

atctCLtgci. 


CtyddodCga 




aaaaaccgcc 


ttgcagggcg 


gtttttcgaa 


ggttctctga 


gctaccaact 


ctttgaaccg 


3720 


aggtaactgg 


cttggaggag 


cgcagtcacc 


aaaacttgtc 


ctttcagttt 


agccttaacc 


3780 


ggcgcatgac 


ttcaagacta 


actcctctaa 


atcaattacc 


agtggctgct 


gccagtggtg 


3840 


cttttgcatg 


tctttccggg 


ttggactcaa 


gacgatagtt 


accggataag 


gcgcagcggt 


3900 


cggactgaac 


ggggggttcg 


tgcatacagt 


ccagcttgga 


gcgaactgcc 


tacccggaac 


3960 
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tgagtgtcag gcgtggaatg agacaaacgc ggccataaca gcggaatfgac accggtaaac 4020 

' i ' 1 

cgaaaggcag gaacaggaga gcgcacgagg gagccgccag gggaaacgcc tggtatcttt 4080 

atagtcctgt cgggtttcgc caccactgat ttgagcgtca gatttcgtga tgcttgtcag 4140 

999ggcg9a9 cctatggaaa aacggctttg ccgcggccct etcacttccc t 4191 

i 1 ' ' 1 

ii , i 

<210> 84 , '. 

<211> 702 1 i 

<212> DNA ' ' . 

<213> 'Bacillus subtilis 

<220> ' 
<221> CDS , ( 

<222> (1) . . (699) 

<400> 84 

ttg tta 1 ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 48 

Met Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
i 5 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 

20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cge tec tta ttt gat * 144 
His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly He He He Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 288 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 

100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 384 
Asn Ala Val Ala Ala lie His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

gat ttc gga acc gec aca acg tac tgc tat att gat gaa aac aaa caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gee cct ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr lie Ser Thr Glu Ala 
145 150 155 160 
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ctt tac teg cgt gca gca aag ctt. cct cgt -ate gaa ate acc cgg'ccc 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu I'le'Thr Arg Pro 

165 170 ' , ' 175 



528 



gac aat 
Asp Asn 



att ate gga aaa aac act gtt age gcg atg caa tct 
lie lie Gly Lys Asn' Thr Val Ser Ala Met Gin Ser 
180 185 ' 190 



gga att 
Gly lie 



576 



tta ttt 
Leu Phe 



ggc 
Gly 
195 



tat 
Tyr 



gtc 
Val 



ggc 
Gly 



caa 
Gin 



gtg gaa gga ate gtt aag cga 
Val Glu Gly lie Val Lys ' Arg 
200 205 



atg aaa 
Met Lys' 



tgg cag gca aaa cag gac cca; agg tea ttg ega cag gag gee tgg cgc 
Trp Gin Ala Lys Gin Asp Pro; Arg Ser Leu Arg Gin Glu Ala Trp lArg 
210 215! 220 



624 



672 



cgc tea ttg cga acg aat cag att gta tag 
Arg Ser Leu Arg Thr Asn Glni lie Val 
225 230 , 



702 



<210> 85 
<211> 233 
<212> PRT 

<213> Bacillus subtilis 



<400> 85 

Met Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly v a i 
1 5. 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 

20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 

85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 

100 105 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 



Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 
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Leu Tyr Ser Arg Ala Ala Lys Leu Pro. Arg lie Glu lie Thr Arg Pro 

165 170' ■ • " ' ' 



175' 



Asp Asn lie lie Gly Lys Asn Thr . Val Ser Ala' Me,t Gin Ser Gly , lie 

180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met. Lys 

195 : 200 ' i 205 

i 

' Trp Gin Ala Lys Gin Asp Prcj Arg Ser Leu Arg Gin Glu Ala Trp Arg 

210 , 215 • 220 1 . ' 



Arg Ser Leu Arg Thr Asn Gin He Val 

225 230 . i 

i ( 

<210> 86 l 
<211> 1623 
<212> DNA 

<213> Bacillus subtilis 



i ■ 



<220> 

<221> CDS 

<222> (1) . . (1620) 



<400> 86 

atg tat ttg gca ttc cag gtg caa aaa ttg atg egg tat ttg acg ctt 
Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 
15 10 15 



48 



tac aag ata aag gac ctg aaa tta teg ttg ccc ggc acg aac aaa acg 
Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 

20 . 25 . 30 



96 



cag caa ttc atg gec caa gca gtc ggc cgt tta act gga aaa ccg gga 
Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 
35 40 45 



144 



gtc gtg tta gtc aca tea gga ccg ggt gee tct aac ttg gca aca ggc 
Val Val Leu Val Thr Ser Gly Pro Gly Ala Ser Asn Leu Ala Thr Gly 
50 55 60 



192 



ctg ctg aca gcg aac act gaa gga gac cct gtc gtt gcg ctt get gga 
Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 
65 70 75 80 



240 



aac gtg ate cgt gca tat cgt tta aaa egg aca cat caa tct ttg gat 
Asn Val He Arg Ala Tyr Arg Leu Lys Arg Thr His Gin Ser Leu Asp 

85 90 95 



288 



aat gcg gcg eta ttc cag ccg att aca aaa tac agt gta gaa gtt caa 
Asn Ala Ala Leu Phe Gin Pro He Thr Lys Tyr Ser Val Glu Val Gin 

100 105 110 



336 



gat gta aaa aat ata ccg gaa get gtt aca aat gca ttt agg ata gcg 
Asp Val Lys Asn He Pro Glu Ala Val Thr Asn Ala Phe Arg He Ala 
115 120 125 



384 



tea gca ggg cag get ggg gee get ttt gtg age ttt ccg caa gat gtt 



432 
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Ser Ala Gly Gin Ala Gly Ala Ala Phe Val Ser Phe Pro Gin Asp Val 

130 ' i 135 ' ' ; ' '140 

' ' ' '"■ ' ■ ' ' 

gtg aat gaa gtc aca aat acg aaa aac gtg cgt get ,'gt't gca ' gcg cca 480 

Val Asn Glu Val, Thr Asn Thr Lys Asn Val Arg Ala Val Ala Ala Pro 

145 150 155 160 

i 

aaa etc ggt cct ejea gca gat gat gca ate agt gcg gec ata gca aaa 528 

Lys Leu Gly Pro Ala Ala Asp Asp Ala lie: Set Ala Ala Tie Ala Lys 

165 ,170 . 175 

' 1 '''''' 

ate caa' aca gca aaa ctt cct gtc gtt ,ttg gtc ggc atg aaa ggc gga 576 

lie Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 

. 180 185 190 

aga ccg gaa gca att aaa gcg gtt cgc aag ctt ttg aaa aag gtt cag 624 

Arg Pro Glu Ala lie Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 , 205. 

ctt cca ttt gtt gaa aca tat caa get gee ggt acc 1 ctt tct aga gat 672 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 . 215 220 

tta gag gat caa tat ttt ggc cgt ate ggt ttg ttc cgc aac cag cct 720 

Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 

225 230 235 240 

ggc gat tta ctg eta gag cag gca gat gtt gtt ctg acg ate ggc tat 768 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr lie Gly Tyr 

245 250 255 

gac ccg att gaa tat gat ccg aaa ttc tgg aat ate aat gga gac egg 816 

Asp Pro He Glu Tyr Asp Pro Lys Phe Trp Asn He Asn Gly Asp Arg 

260 265 270 

aca att ate cat tta gac gag att ate get gac att gat cat get tac 864 

Thr He He His Leu Asp Glu He He Ala Asp He Asp His Ala Tyr 
275 280 285 

cag cct gat ctt gaa ttg ate ggt gac att ccg tec acg ate aat cat 912 

Gin Pro Asp Leu Glu Leu lie Gly Asp He Pro Ser Thr He Asn His 
290 295 300 

ate gaa cac gat get gtg aaa gtg gaa ttt gca gag cgt gag cag aaa 960 

lie Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 

305 310 315 320 

ate ctt tct gat tta aaa caa tat atg cat gaa ggt gag cag gtg cct 1008 

He Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val. Pro 

325 330 335 

gca gat tgg aaa tea gac aga gcg cac cct ctt gaa ate gtt aaa gag 1056 

Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu lie Val Lys Glu 

340 345 350 

ttg cgt aat gca gtc gat gat cat gtt aca gta act tgc gat ate ggt 1104 

Leu Arg Asn Ala Val Asp Asp His Val Thr Val Thr Cys Asp He Gly 
355 360 365 
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teg cac tec att tgg atgtca cgt tat ttc cgc age tap gag, ccg tta 1152 

Ser His Ser lie' Trp Met Ser Arg Tyr Phe A!rg Ser Tyr Glu Pro Leu 

370 ' 375 - • 380 ' ' . ' 

i ■ 

aca tta atg ate agt aac ggt atg caa aca etc ggc gtt gcg ctt cct 1200 
Thr Leu Met lie Ser Asn Gly Met Gin Thr Leu Gly Val Ala Leu Pro 

385 , 390 , ' 395 , ' 400 

ii . ■ i 

tgg gca ate ggc get tea ttg gtg aaa. ccg gga gaa aaa 'gtg gtt tct 1248 

Trp Ala lie Gly Ala Ser Leu Val ' Lys Pro Gly Glu Lys Val Val Ser 

' ' 405 ' >410 415 

i ■■ ■ 

gtc t.ct ggt gac ggc ggt ttc tta ttc tea gca atg gaa tta gag aca 1296 

Val Ser Gly Asp Gly Gly Phe Leu Phe Ser, Ala Met Glu Leu Glu Thr' 

420 425 430 

t ' 

gca gtt cga eta aaa gca cca att gta cac att gta tgg aac gac age 1344 
Ala Val Arg Leu Lys Ala Pro lie Val His He Val Trp Asn Asp Ser 
1 435 ' 440 445 

aca tat gac atg gtg cat ttc cag caa ttg aaa aaa tat aac cgt aca 1392 
Thr Tyr Asp Met Val His Phe Gin Gin Leu Lys Lys Tyr Asn Arg Thr 
450 455 460 

tct gcg gtc gat ttc gga aat ate gat ate gtg aaa tat gcg gaa age 1440 
Ser Ala Val Asp Phe Gly, Asn He Asp He Val Lys Tyr Ala Glu Ser 
465 470 475 480 

ttc gga gca act gcg ttg cgc gta gaa tea cca gac cag ctg gca gat 1488 
Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 

485 490 495 

gtt ctg cgt caa ggc atg aac get gaa ggt cct gtc ate ate gat gtc 1536 
Val Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val He He Asp Val 

500 505 510 

ccg gtt gac tac agt gat aac att aat tta gca agt gac aag ctt ccg 1584 
Pro Val Asp Tyr Ser Asp Asn He Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 

aaa gaa ttc ggg gaa etc atg aaa acg aaa get etc tag 1623 
Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 535 540 



<210> 87 
<211> 540 
<212> PRT 

<213> Bacillus subtilis 
<400> 87 

Met Tyr Leu Ala Phe Gin Val Gin Lys Leu Met Arg Tyr Leu Thr Leu 
15 10 15 

Tyr Lys He Lys Asp Leu Lys Leu Ser Leu Pro Gly Thr Asn Lys Thr 

20 25 30 
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< ■ 

Gin Gin Phe Met Ala Gin Ala Val Gly Arg Leu Thr Gly Lys Pro Gly 

35 40 i , 1 45 ■ • , 

Val Val Leu Val Thr Ser Gly Pro Gly Ala 'Ser Asn Leu Ala Thr Gly 

.50 ( ' 55 ■ , 60 

Leu Leu Thr Ala Asn Thr Glu Gly Asp Pro Val Val Ala Leu Ala Gly 

65 70 , 1 75 ,80 



■ • i 



■. Asn Val lie Arg Ala Tyr Arg 

85 



Leu Lys Arg Thr His Gin Ser Leu ,Asp 
90 . 95 ' ' 



Asn Ala Ala Leu Phe Gin Pro! lie Thr Lys Tyr Ser Val Glu Val Gin 

100 105 110 

! ' i I- • 

Asp Val Lys Asn lie Pro Glu! Ala Val Thr Asn Ala Phe Arg lie Ala 
115 120 125 

i 

f 

Ser Ala Gly Gin Ala Gly Ala^ Ala Phe Val Ser Phe Pro Gin Asp Val 
130 .135 140 



Val Asn Glu Val Thr Asn Thr! Lys Asn Val Arg Ala Val Ala Ala Pro 



145 



150 i 



155 



160 



Lys Leu Gly Pro Ala Ala Asp Asp Ala lie Ser Ala Ala lie Ala Lys 

165 170 175 

lie Gin Thr Ala Lys Leu Pro Val Val Leu Val Gly Met Lys Gly Gly 

180 185 190 

Arg Pro Glu Ala lie Lys Ala Val Arg Lys Leu Leu Lys Lys Val Gin 
195 200 205 

Leu Pro Phe Val Glu Thr Tyr Gin Ala Ala Gly Thr Leu Ser Arg Asp 
210 215 . 220 

Leu Glu Asp Gin Tyr Phe Gly Arg lie Gly Leu Phe Arg Asn Gin Pro 
225 230 235 240 

Gly Asp Leu Leu Leu Glu Gin Ala Asp Val Val Leu Thr lie Gly Tyr 

245 250 255 

Asp Pro lie Glu Tyr Asp Pro Lys Phe Trp Asn lie Asn Gly Asp Arg 

260 265 270 

Thr lie lie His Leu Asp Glu lie lie Ala Asp lie Asp His Ala Tyr 
275 280 285 

Gin Pro Asp Leu Glu Leu lie Gly Asp lie Pro Ser Thr lie Asn His 
290 295 300 

lie Glu His Asp Ala Val Lys Val Glu Phe Ala Glu Arg Glu Gin Lys 
305 310 315 320 

lie Leu Ser Asp Leu Lys Gin Tyr Met His Glu Gly Glu Gin Val Pro 

325 330 335 



Ala Asp Trp Lys Ser Asp Arg Ala His Pro Leu Glu lie Val Lys Glu 
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340 



345 



350 



Leu Arg Asn Ala Val Asp Asp His Val Thr ( Val Thr' Cys Asp lie <Gl'y 
355 * 360 > 1 365 . ' 

Ser His Ser lie Trp Met Ser Arg Tyr Phe Arg Ser Tyr Glu Pro Leu 
370 • 375' ( 380 

, ■ 1 1 

1 

Thr Leu Met lie Ser Asn Gly Met Gin Thr Leu Gly Val, Ala Leu Pro 



385 



390 



395 



4 00 



Trp Ala lie Gly Ala Ser Leu, Val Lys Pro Gly Glu Lys Val Val Ser 
' 405 | 410 415 

Val Ser Gly Asp Gly Gly Phe- Leu Phe Ser Ala Met Glu Leu Glu '^hr 

420 ! 425 430 

Ala Val Arg Leu Lys Ala Pro lie Val His lie Val Trp Asn Asp Ser 
435 / 440 445 

1 

Thr Tyr Asp Met Val His Phe, Gin Gin Leu Lys Lys Tyr Asn Arg Thr 

450 455! , 460 

i 

1 

1 l 

Ser Ala, Val Asp Phe Gly Asn lie Asp lie Val Lys Tyr Ala Glu Ser 
465 470 475 480 

Phe Gly Ala Thr Ala Leu Arg Val Glu Ser Pro Asp Gin Leu Ala Asp 

485 490 495 

Val Leu Arg Gin Gly Met Asn Ala Glu Gly Pro Val lie lie Asp Val 

500 505 510 

Pro Val Asp Tyr Ser Asp Asn lie Asn Leu Ala Ser Asp Lys Leu Pro 
515 520 525 



Lys Glu Phe Gly Glu Leu Met Lys Thr Lys Ala Leu 
530 535 540 



<210> 88 
<211> 23 
<212> DNA 

<213> Artificial Sequence' 
<220> 

<223> Description of Artificial Sequence: ribosome 
binding site 

<220> 

* 

<223> All occurrences of n indicate any nucleotide 
<400> 88 

agaaaggagg tgannnnnnn atg 23 



<210> 89 
<211> 7 
<212> PRT 
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<213> Artificial Sequence 



<220> • ' ' • "■ i ( 

<223> Description of Artificial Sequence: PanC 
C terminus ' ' 

<400> 89 

lie Arg Glu Met Glu Arg lie < ' 

1 1 5 ' ' 



<210> 90 1 
<211> 6 ( 
<212> PRT 

<213> Artificial Sequence 



i 



<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 90 

lie Arg ,Glu Arg Arg 
1 5 



<210> 91 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PanC 
C terminus 

<400> 91 

lie Arg Arg Lys Glu Val Asn 
1 5 

<210> 92 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN336 plasmid 

<400> 92 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgcacc aggcttctca 240 
ggcgctgact tagaaaacct cttgaatgaa gctgcgcttg tagcggctcg tcaaaacaag 300 



icrvu^irv /wn ni o 1-7-70 a o i 
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cacaccgtta tcggtctcgt tttagatgag gcagatatgg ttcataaagt aacgattgtt 480 



gatcaaatcg cttacgaaat tgatcaggaa attcagcgca tcatcaaaga atgttatgag 840 



cctgagcgta atttctcaga tgatgaaaag aacgatgatg tgaaagtaaa cattctgaca 1020 
aaaacagaag aaaagaaaga cgatacgaaa gagtaattcg ctttctttct aaaaaaactg 1080 
ccggctgacg ctggcagttt ttttatgtaa atgattggct cagctgcggc ttttacaatc 1140 
atccaattct ggtatcgatt tgtttacaaa tgagccgctg atcgtgtatg gtattgtaga 12C0 
atgtttgtaa aaagtaaagt agagaaacta ttcaaaagtg gtgatagagg ttgttactgg. 1260 
ttatcgatgt ggggaacacc ctgcagctcg agtgaaatac cgcacagatg egtaaggaga 1320 
aaataccgca tcaggcgata aacccagcga accatttgag gtgataggta agattatacc* 1380 
gaggtatgaa aacgagaatt ggacctttac agaattactc tatgaagcgc catatttaaa 1440 
aagctaccaa gacgaagagg atgaagagga tgaggaggca gattgccttg aatatattga 1500 
caatactgat aagataatat atcttttata tagaagatat cgccgtatgt aaggatttca 1560 
gggggcaagg cataggcagc gcgcttatca atatatctat agaatgggca aagcataaaa 1620 
acttgcatgg actaatgctt gaaacccagg acaataacct tatagcttgt aaattctatc 1680 
ataattgtgg tttcaaaatc ggctccgtcg atactatgtt atacgccaac tttca^aaaca 1740 
actttgaaaa agctgttttc tggtatttaa ggttttagaa tgcaaggaac agtgaattgg 1800 
agttcgtctt gttataatta gcttcttggg gtatctttaa atactgtaga aaagaggaag 1860 
gaaataataa atggctaaaa tgagaatatc accggaattg aaaaaactga tcgaaaaata 1920 
ccgctgcgta aaagatacgg aaggaatgtc tcctgctaag gtatataagc tggtgggaga 1980 
aaatgaaaac ctatatttaa aaatgacgga cagccggtat aaagggacca cctatgatgt 2040 



cggacccgct 

• , • . • . -*• 


360 

i 


i 

cgaaggc'gga 


420 


aacgattgtt 


480 


ttatttccaa 


540 


tgctgaagag 

1 


600 


1 

1 

gacgaatatt 


660 


acaatt t aaa 


720 


aaacta'caat 

^4 ^» «M *M 

1 


780 


a t atta tcraa 


840 


ccaaacactt 


900 


t aaaacat t a 


960 


cattctgaca 


1020 


aaaaaaactcr 


1080 


^tttacaa^c 

^ \_v W ^» U . J ^ 


1140 


atattfitaaa 


12C0 

mh. J~ \S Nn/ 


ttattactaa 


1260 


cat aaaaaoa 


1320 


aaat tat - ace* 


1380 


catatttaaa 


1440 


aatatattga 


1500 


aaggatttca 


1560 


aagcataaaa 


1620 


aaattctatc 


1680 


tttc^aaaca 


1740 


agtgaattgg 


1800 


aaagaggaag 


1860 


tcgaaaaata 


1920 


tggtgggaga 


i980 


cctatgatgt 


2040 
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ggaacgggaa 


aaggacatga 

t 


1 ^k. _T*l_| _n_| 4* 

tgctatggct 

t ■ • • 


gcactttgaa 


' i 

cggcatgatg 


gctggagcaa 


ttgctcggaa 


gagtat gaag 


i 

atgaacaaag 


gtgcatcagg 


ctct t t cact 

i 


ccatcgacat 


cagccgctta 


1 L 1 

gccgaattgg 


attacttact 


aaactgggaa 

' 'i < 


gaagacactc 


catttaaaga 


ggaaaagccc 


i 

gaagaggaac 


ttgtcttttc 


tgtgaaagat 


ggcaaagtaa 


gtggctttat 


gtggtatgac 


attgccttct 


gcgtccggt.c 


tgtcgagcta 


i 

l 

ttttttgact 


tactggggat 


• i 

tattttactg 


gatgaattgt 


tttagtacct 


aagcttttta 


gacatctaat 


cttttctgaa 


tttatat ctt 


ttctaaaagt 


t.cgc'tagata 


tcgccattcg 


ccat tcaggc 


tgcgcaactg 


caggcaaaac 


aggacccaag 


gtcatt gcga 


aatcagattg 


tatagacatc 


gttgat ccat 


atgaaagaaa 


ccgcgtagga 


agtgtatagg 


act t cjcg t at 


gacggaaaag 


ttcgggctta 


ggggcagaga 


cgccatggta 


cgtggccgac 


agcttcactt 


atgctcggcg 


ctatgctgaa 


_ a. 

gggcggaggt 


ccgatcggag 


ctattgtagc 


ctatgt ct ct 


aacccgcaag 


ttcatt ttga 


acgtgcggtt 


ggaacaaacg 


gaacgttaag 


cttcacagga 


caagt agaaa 


tcgt t tcagg 


tgtgtcatct 


gagcaggttc 


cttcatcagt 


t accatt ct t 


gcggcagggg 


gctttattat 


aatcacaaaa 


attgaacagc 


gtctatctca 


agggct gaca 


ccagaagaaa 


ttttagaaga 


aacgatgcct 


gtcagattcc 


attgcccttg 
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rtacrt* ait* t* 

i-.- . ,;. 


• 


'2100 




ant - C\ r\ c\ c\ c c c\. 

ct y u^quu w v y 


CI ••MMvU ^ l> 

1 




t 


\~ \~ pi Y c ri pi et c* 


'1 

Ld V^^O 


2220 


t i 


LaLo\«%JCl 


a i. ay u l u dy a 


22ft0 

^ U \J 

t 


(jaatddLUol 

i 1 • 


r- 4" /-r /~i r~« a /"t 


u y y a l. l. y v^y ct 


2*^4 O 


i ■ , 

uccgcgcgag 


1 

f*^ 4>* Q 4* *m ~s 4™ 4^ 

Cuy UatgaLL 


4™ 4— 4- ^ ^> « 

■ UlttaaayaC 


t 


ccacggcgac 

t 


cugggagaca 


^m— ^mm^ A. a a 4~ 4* 

gcaacdLCit 




t gatct tggg 


agaageggea 


gggcggacaa 


o c: o n 


gatcagggag 


ga t at egggg 


aagaacagta 




caagcctgat 


tgggagaaaa 

•i 


taaaatattia 




agatttagat 


gtctaaaaag 


ctttaactac 


O T 

2700 


gtacatccgc 


aactgtccat 


i 

actctgatgt 


^ o f\ 
21 SO 


ggggtcccga 


gcgcctacga 


ggaatt tgta 


, 2820 


ttgggaaggg 


egateggtzge 


ggt cgactgg* 


o O O 1^ 


caggaggcct 


ggcgccgct c 


att gcgaacg 


^94 0 


tcttaaccct 


aaaagggct g 


gaavtgar.tt* 


Jt'UU 


aggtt tagta 


dtggattatt 


tagtaaaagc 


oUbU 


t gcagcgaga 


a eg act gat a 


t ggt a a a tiga 




agcatccgcc 


gcactaggcc 


gcacaatgac 


•31 on 
jloU 


gggcga tgat 


aagctgaccg 


tgaaaatcga 






yL«\_clclcigycig 


dciy LUdy dy l. 




(_ U L add L. y a a 


i— ci a. y y l day l> 


l. ty a y uvoy 


3360 

•J J U w 


tgucguaaaa 


^ 4- 4- 4— ^ ^ 

LjaLLLdyyLL 


4— f% n j~r -j f-y 4- 4— 

uyL^y^yciy lc 


3d 2H 


d U o d i- Lduu d 


ya Luat l. t»L.ci 


V^t C C* \* LCI L d 


34 80 


gggcgtaggt 


gtgctcgtaa 


atcctgacaa 


3540 


tcagctgatg 


ccgggaacag 


atgatgaaac 


3600 


agtagagccg 


atttctaagc 


tcatccaaaa 


3660 


agtcctaggc 


gagaaacctg 


agattttgga 


3720 


ttcaaaagaa 


cggttcgaaa 


cagecatttt 


3780 
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aggactaggc 


aaaaaaqaaa 

■ i 


atgccatttt 


* 

tgtaatgaaa 

1 | 1 


ccaaactacc 


cgctaagctc 


qtttgaaatc 


aagaacaatc 


acctaaatag 


cttggcgtaa 


tcacaattcc 


acacaacata 


i 

aaataaacta 


r 1 

actcacatta 


t.gtcgtgcca 


qctqcattaa 


aacactcttc 


1 

cgct t cctcg 


coatatcaac 


tcactcaaao 

1 


aaaaaaacat 

\a<% 

1 


ataaacaaaa 


t GQCCtt 1 1 1 1 


caataaactc 


aciaacji" aaca 


aaacccgaca 


tccitacoctc 


tcctottc^a 


caaaaaacat 

~j 


aacactt tct 


1 1" ccictccaa 


actaaactat 

m ^3 


cccrat aact a 


t cgt ct t gag 


ccactaataa 


Caacattaoc 


ggtggcct aa 


ct acggctac 


cagtt acct t 


cggaaaaaga 


gcggtggttt 


ttttgtttgc 


atcctttgat 


cttttctacg 


ttttggtcat 


gagat tatca 


gttttaaatc 


aatctaaagt 


tcagtgaggc 


acctatct ca 


ccgtcgtgta 


gat aactacg 


taccgcgaga 


cccacgctca 


gggccgagcg 


cagaagtggt 


gccgggaagc 


t agagtaagt 
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ttcaagatat gat,agaagaa 
agtacttatt tacaaaagaa 
tttagcgggt ttttaatttg 

tggacgcgtt ggatgcatag 

1 i 

i 

tcatggtcat agctgtttcc 
cgagccggaa 1 gcataaagtg 
attgcgttgc gctcactigcc 

. i 

i 

tgaatcggcc aacg'cgcggg 

i 

' i 

ctcactgact cgctgcgctc 
gcggtaatac ggttatccac 
ggccagcaaa aggccaggaa 

r t 

cgcccccctg acgagcatca 
ggactataaa gataccaggc 
accctgccgc ttaccggata 
catagctcac gctgtaggta 
gtgcacgaac cccccgttca 
tccaacccqg taagacacga 
agagcgaggt atgtaggcgg 
actagaagga cagtatttgg 
gttggtagct cttgatccgg 
aagcagcaga ttacgcgcag 
gggtctgacg ctcagtgga.a 
aaaaggatct tcacctagat 
atatatgagt aaacttggtc 
gcgatctgtc tatttcgttc 
atacgggagg gcttaccatc 
ccggctccag atttatcagc 
cctgcaactt tatccgcctc 
agttcgccag ttaatagttt 



PCTYUS00/25993 

gatggacaag ccgaagcagt 3840 

'i 

gagctggaag cigcttcgtga 3900 
agaaaagggg ctgaaagcag 3960 
cttgagtatt ctatagtgtc 4020 
tgtgtgaaat tgttatccgc 4080 
taaagcctgg ggtgcctaat 4140 
cgctttccag tcgggaaacc 4200 
gagaggcggt ttgcgtattg 4260 
ggtcgttcgg ctgcggcgag 4 320 
agaatcaggg gataacgcag 4380 
ccgtaaaaag gccgcgttgc 44 40 
caaaaatcga cgctcaagtc 4500 
gtttccccct ggaagctccc 1 4 560 
cctgtccgcc tttctcccttv 4 620 
tctcagttcg gtgtaggtcg 4 680 
gcccgaccgc tgcgcct tatv * 4 7 4 0 
cttatogcca ctggcagcag 4 800 
tgctacagag ttcttgaagtV 4860 
tatctgcgct ctgctgaagc 4920 
caaacaaacc accgctggta 4980 
aaaaaaagga tctcaagaag 5040 
cgaaaactca cgttaaggga 5100 
ccttttaaat taaaaatgaa 5160 
tgacagttac caatgcttaa 5220 
atccatagtt gcctgactcc 5280 
tggccccagt gctgcaatga 5340 
aataaaccag ccagccggaa 5400 
catccagtct attaattgtt 5460 
gcgcaacgtt gttggcattg 5520 
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ctacaaocat 


CQtaatat ca 


cact cat cat 


ttoatataac 

W \* fc* 

1 


ttcattcaac 

* 1 


tccaattccc 

r , 


5580 

\# 

1 • 


aacoatcaaa 


acaaottaca 


taatccccca 


tattatacaa 

i 


aaaadcoat t 

1 

i 


1 

aactcctt ca 

1 1 


5640 


ntrrlrrfiat 

y v» l> ^— y c* ^ 




u y ci y v_ i_ y y 


rrnraotatt 

v*» y v»r cj y w y h> v» 


■ atcactcata 


1 

attataacaa 


5700 


cactacataa 


ttctcttact 


at cat a a cat 

N-j W ^ U ^ 


ccataaoata 

( 


cttttctata 


actootaaat 


5760 




y L. ci ^ ^ s»* y c* 


1 

ycici ^ct^^y^y 


^UV^U U ^ y Cl 


1 

oaottoctct 

y u y k> y ^ V- 


1 

tflcccaacat 

^ y v \* w y y w w> 


5820 


/""• O ^3 -f" J3 /•*• /~i /"» /~t a 

octet l ci o y y y ci 


uctct l. ci y uy uci 


f nap 24 ^t* anra 

I 


1 ( 

naarfft asp 


ay Lyoi.ocico 


' i 

ct ^ L.yycicici ct w 




/-t 4— *f-^ 4" /-» /™i n 

y LtLt i. t.yy y 


yuy daadL.LC 


+- /~> jri /*r ■f" ^ +* 
LLaayydUL»t 


i_dov«yUrL.y l> l> 


yoya L. ^ o ct y i_ 


t" rria I" n^aar 
LU^Q l. y LOaU 




/~* y> —\ 4— /"—i h ^» /T 

CCaCtCyLyC 


aCCCadCiya 


CCL CCayCdL 


LLLLLdULLL 




ll Lyy y l vjct(j 


TJ \J VJ 


caaaaacayg 


aaggcaaaan 


^■f ^\ Tl "T* ^ 

geegcaaaaa 


n n a 3 a /^r 

agyyaa taag 


yyCyoCdCyy 


3 O ^ "f - 4" A*f 3 TH 

adaCyi tyad 


DUDU 


tact ca tact 


ct t ccttt t t 


caatattat t 


gaagcat tt a 


t cagggtt a t 


tytctcatga 


DlZU 


geggatacat 


atittgaatgt 


atttagaaaa 

i 


acaaacadat 


agggg u t ccg 


cgcacatLLC 


DlOU 


cccgaaaagt 


gccacctgta 


4— ^« A* * 

u gegg tgtga 


aataccgcac 


agar gcgtiaa 


ggagaaaaca 




ccgcatcagg 


cgaaattgta 


aacgetaata 


4* 4* 4~ 4* 4" ^ ^ *^ 

tt Ltyt taad 


^» ^» ^* 4^ 4 — *~* 

dULCyCytta 


adiai l i yt t 




— «l ** 4» -^l rf*^ 

aaciccagctc 


atctr.Ltaac 


caanaggccg 


*** ^ ~\ 

aadlCyyCaa 


|— ■ 4— 4— 

aa CCCCLUat 


add LCddaay, 


c fin 


aatagaccga 


gat agggttg 


agtyuugnuc 


CayLLr.yydd 


f-^ -^1 ^ ~} Mr /» - 

Cddyay LCLd 


LkaLlaaaya 




acgtggactc 


caaegtcaaa 


gggcgaaaaa 


ccgtctatca 


gggcgatggc 


ccactacgtgv 


6480 


aaccatcacc 


caaatcaagt 


tttttgcggt 


cgaggtgccg 


t aaagctcta 


aateggaace 


6540 


ctaaagggag 


cccccgattt 


agagcttgac 


ggggaaagee 


ggcgaacgtg 


gcgagaaagg. 


6600 


aagggaagaa 


agegaaagga 


gegggegcta 


gggcgctggc 


aagt gt ageg 


gtcacgctgc 


6660 


gcgtaaccac 


cacacccgcc 


gcgcttaa 








6688 



<210> 93 
<211> 8503 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Recombinant 
pAN004 plasmid 

<400> 93 

gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta aeggggcagg 60 
ttagtgacat tagaaaaccg actgtaaaaa gtacagtegg cattatctca tattataaaa 120 
gecagtcatt aggectatet gacaattcct gaatagagtt cataaacaat ectgeatgat 180 
aaccatcaca aacagaatga tgtacctgta aagatagegg taaatatatt gaattacctt 240 
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'tattaatgaa tttt'cctgct gtaataatgg gtagaaggta attactatta tt'attga'tiati 30'0 

ttaagttaaa cccagtaaat gaagtccatg gaataataga' aacjagaaaaa gcattttcag 360 

gtataggtgt tttgggaaac aatttccccg aaccattata tttctctaca tcagaaaggt 420 

ataaatcata aaactctttg aagtcattct ttacaggagt ccaaatacca gagaatgttt 480 

1 tagatacacc atcaaaaatt gtata'aagtg gctctaactt atcccaataa cctaactctc 540 



cgtcgctatt 

i 


gtaaccagtt 


ctaaaagctg 


tat ttgagtt 


tatcaccctt 


gtcactaaga 


600 


aaataaatgc 


agggtaaaat 


ttatatcctt 


cttgttttat 


gtttcggtat 


aa.aacactaa 

1 r . 


660 


tatcaatttc 


tgtggttata 


i 

ctaaaagtcg 


tttgttggtt 


caaataatga 


ttaaatatct 


720 


cttttctctt 


ccaattgtct 


aaatcaattt 

i < 


tattaaagtt 


i 

catttgatat 


gcctcctaaa 


780 


tttttatcta 


aagtgaattt 


■ 

aggaggct ta 


cttgtctgct 


ttcttcatta 


gaatcaatcc 


840 


ttttttaaaa 


gtcaatatta 


ctgtakcata 


aatatatatt 


ttaaaaatat 


cccactttat 


900 


ccaattttcg 


tttgttgaac 


taatgggtgc 


tttagttgaa 


gaat aaagac 


1 

cacattaaaa 


960 


aatgtggtct 


tttgtgtttt 

* * 


tttaaagga t 


ttgagcgtag 


cgaaaaatcc 


ttttctttct 


1020 


tatcttga ta 


ataagggtaa 


ctattgaa t t 


cggtaccaag 


ag tttgtaga 


aacgcaaaaa 


1080 


ggccatccgt 


caggatggcc 


t tctgcttaa 


tttgatgcct 


ggcagtttat 


ggcgggcgtc 


1140 


ctgcccgcca 


ccctccgggc 


cgttgcttcg 


caacgt tcaa 


atccgctccc 


ggcggatttg; 


1200 


tcctactcag 


gagagcgttc 


accgacaaac 


aacagataaa 


acgaaaggcc 


cagtctt tcg^> 


1260 


actgagcctt 


tcgttttatt 


tgatgcctgg 


cagttcccta 


ctctcgcatg 


gggagacccc 


1320 


acactaccat 


cggcgctacg 


gcgtttcact 


tctgagttcg 


gcatggggtc 


aggtgggacc 


1380 


accgcgctac 


tgccgccagg 


caaattct gt 


tt tatcagac 


cgcttctgcg 


ttctgattta 


1440 


atctgtat ca 


ggctgaaaat 


cttctctcat 


ccgccaaaac 


aggatcctac 


ggaaatggag 


1500 


cggcaaaacc 


gttttactct 


caaaatctta 


aaagaaaacc 


cccgataaag 


ggggcttttc 


1560 


ttctacaaaa 


ttgtacgggc 


tggttcgttc 


cccagcattt 


gttcaatttt 


gttttgatca 


1620 


ttcagaacag 


ccactttcgg 


ctcatggctt 


gccgcttctt 


gatcagacat 


cattttgtag 


1680 


gaaataataa 


tgaccttatc 


tccttcctgc 


acaaggcgtg 


cggctgcacc 


gtttaagcat 


1740 


atgacgccgc 


ttccccgttt 


accaggaata 


atatacgttt 


caagacgtgc 


tccattatta 


1800 


ttattcacaa 


tttgtacttt 


ttcattagga 


agcattccca 


cagcatcaat 


gagatcttca 


1860 


tcaattgtaa 


tgcttcccac 


atagttcagg 


tttgcttccg 


taacagttgc 


cctgtgaagt 


1920 


ttgccgctca 


tcattgttcg 


atacatatta 


tattctctcc 


atttctcgaa 


tatcaataat 


1980 
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gatattatct 


attaaacgcg 

i 


agcaatttca 


i 

t,tcacaggct 


tccgctagtc 


gtttcaatga 


accggcttgg 


acaagttccg 

1 


tctttcctca 


gctgttaagt 


gaccgt atcg 


acaggaacca 


aacagctacc 


tgctgcgcat 


gaaaagct tc 


gtcagtacga 


taacacgtct 


gtgcgtcttt 


.atgagcaitct 


ggcgtaaaaa 


ctcaatatcg 


cgcggatatg 


aaaaatactc 


ataataacgg 


cccctcatgc 


agaaacccca 


t tgttttatg 


gcttctttca 


cgtacaagcc 


gtcaagcact 


cacgatgtct 


tacatcctga 


gcgtatattg 


ctttacaaat 


aaacgagaac 


ttgtccgtcc 


gcgtctcggc 


aattttggct 


ctcctgcttc 


ttcgcatttt 


tgccctgtac 


tttatagccg 


t gactact gg 


aatgcctcca 


cctcaagctt 


cagtgcgtca 


tatcttcctt 


agacaggtga 


gcgcacccct 


tttaacggct 



ttgaatcaag gccgaggacg 
ccgcttgttc agcaagttta 
cttcagactc cttcattttt 
gagcgtcctg ctgttgttaa 
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1 

cttttgaaaa 

1 i 


* i 

* 

ageaactgea 

BP* ^ 


cgagttccgg 


1 f . 

ataggaataa 

^ BP* 

1 


tatcttttgc 


1 

agcttttatc 


cacttgtttg 

i 


aaqggcccga 


atacattgcg 


aga'gcttttg 

i 


att caatatc 


cat gaaqaaq 


cttttaaacc, 


gaaataggca 


tcacaacccc 


at caaaat qt 

^4 ^bp ^bp» ^p"f vn- ^b*1 ^b> w 

t 


ct acatgaat 


cgt gacattc 


oaatatcaac 


tccoacattt 

^M** ^B" ^PJ V-J >BP* ^P* 


cttcaaaatc 


ttcattaaaci 


catcattttc 


t tgt ettget 


tcgtcggaac 


aaatccgatt 


actatoaaa t 


atcaotaatc 

%p* ^p» V4 ^ J * ^Pl ^b^ 


atctaattca 


tt t qaaaqqa 

*bp> ^* ^pj ^pj ^»pp 


a cat at ccoc 

^ta^ ^BP" X-BB- ^BP ^BB^ ^BB"" ^BB* 


t aat tqctqt 


ttaaatattc 


tctcaacacc 


gctttcacac 


cagccccgat 


qtgaqttctq 


ccggcacaca 


atactgtctt 


ctattaattt 


cccagtacgc 


cgactgactg 

ZJ -J -J 


agcgtcaatg 


cgcgaatgga 


get ccgcttt 


cctgaacgat 

BPF 


taagacataa 


aeggcat ate 


tttgtatgat 


ggatcatgtc 

Br BBr 


accattccaa 


gtgaatcacc 


getgeeggat 


aatcataagc 


agaaaatcca 


gttttgtttt 


gattattata 


ccacaccttg 
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acagegagaa tcatctttcc 2040 
agctctacat agftctatggt 2100 
accgcttcag gatctctttc 2160 
tacagcttag gcgcttcttt 2220 
gctaagcqg.t cttcctctct 2280 
tegctgatta acccatcaac 2340 
egagtegget tgactagatt 2400 
ccttctcttg agcgcccgca 2460 

ttttcacegg gatacatatc 2520 

i 

tetgeaagag ctgcatcccg 2580 
ccgaattgtg caggattcac 2640 
ttgtctgcta aggttaaatg 2700 
gacttgccct ctgaatggta 2760 
tgtctcatct tattttcccc 2820 
atgcttttgt tcagggaaag 2880 
ttcgatggtt teatcaatge* 2940 
gtggccgata atatcatgat. 3000 
tccaatgacc ggtatgetta 3060 
ttccagcaca agcatcatag 3120 
tttggcgctt tgttcgtctt 3180 
cggtgtcaaa cctaagtgac 3240 
ttcaaacacg ccttctccgc 3300 
agccgctgca tttttcagcg 3360 
tgtcacaata aaggtattcg 3420 
cgcaactgtc acaccgacag 3480 
gactaaaatc atgtcaactc 3540 
ggtcagcatg acaateggtt 3600 
catgttttct cctccttcta 3660 
tagataaagt caacaacttt 3720 
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ttgcaaaatt 


i 

• i 

tttcaggaat 


i 

tttagcagag 


ccgctcttgt 


gctgttagga 


tatctttctt 


ttggttaaaa 


ggaaacaaaa 


agaccgtttt 


tacagtcaca 


.... 1 
gccac^tttg 


caaaaaccgg 


gtcgacaagc 


ttcgcgaagc 


ggccgcaaaa 

\ • i 


ctgggaaaac 


cctggcgtta 

i 


cccaactt aa 


ctggcgtaat 


agcgaagagg 

i 


cccgcaccga 


tggcgaatgg 


cgcctgatgc 


ggtattttct 


catatggtgc 


actctcagta 


i 

caatctgctc 


cccgccaaca 


cccgctgact 


atgcttgtaa 


aaaggggacc 


tctagggtcc 


ccaattaatt 


aggtcatcca 


ccggatcagc 


ttagtaaagc 


t tacttcgcc 


aactattgcg 


ataacaagaa 


tttgtgtagg 


gcttattatg 


cacgcttaaa 


tggctgtgag 


caattatgtg 


cttagtgcat 


cggcgtcggc 


ttgaacgaat 


tgttagacat 


t t cacgt agt 


ggacaaat tc 


ttccaactga 


tgtccaagat 


aagcctgtct 


agcttcaagt 


tccattgccc 


agtcggcagc 


gacatccttc 


caaatgcggg 


acaacgtaag 


cactacattt 


tt ccatagcg 


ttaaggtttc 


atttagcgcc 


aagagttcct 


ccgccgctgg 


acctaccaag 


aagatagcca 


gatcaatgt c 


gatcgtggct 


cgctgccatt 


ctccaaattg 


cagttcgcgc 


tcgtgcacaa 


caatggtgac 


ttctacagcg 


gaagtttcca 


aaaggt cgtt 


gatcaaagct 


accgtaacca 


gcaaatcaat 


atcactgtgt 


tacaaatgta 


cggccagcaa 


cgtcggttcg 


gatagttgag 


tcgatacttc 


ggcgatcacc 
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gttgttctgg 

1 i 


atgtagaaca 


aaacatcttt 


3780 


ggaagctagg 

i 


taggcctcga 


gbtatggcag 


3840 


cacacaaaac 


ggtctttttc 


gatttctttt 


3900 


acagcttcat: 


gccttataac 

■ ■ i 


tgctgtttcg 


3960 


ttcactggcc 


i 

gtcgttttac 


aacgtcgtga 


4020 


tcgcct tgca 

» 


gcacatcccc 


ctttcgccag 


4080 


tcgcccttcc 


i 

caacagt tgc 


gcagcctgaa 


4140 


ccttacgcat 


ctgtgcggta 


tttcacaccg 


4200 


tgatgccgca 


tagttaagcc 


agccccgaca 

i 


4260 


accgttttgt 


gaaaaaattt 


ttaaaataaa 


4320 


agtaatataa 


tctattaaag 


gtcattcaaa 


4380 


cctcgctaga 


ttttaatgcg 


gatgttgcga 


4440 

i 


aaagccagcc 


tttcatgata 


tatctcccaa 


4500 


aataataaaa 


gcagacttga 


cctgatagtt 


4560 


ctaacgcttg 


agttaagccg 


cgccgcgaag 


4620 


tatt tgccga 


ctaccttggt 


gatctcgcct 


4 680 


tctgcgcgcg 


aggccaagcg 


atcttcttct. 


4740 


atgacgggct 


gatactgggc 


cggcaggcgc 


4800 


ggcgcgattt 


tgccggttac 


tgcgctgtac 


4860 


cgctcatcgc 


cagcccagtc 


gggcggcgag 


4920 


tcaaatagat 


cctgttcagg 


aaccggatca 


4980 


gcaacgctat 


gttctcttgc 


ttttgtcagc 


5040 


ggct cgaaga 


t acctgcaag 


aatgtcattg 


5100 


ttagctggat 


aacgccacgg 


aatgatgtcg 


5160 


cggagaatct 


cgctctctcc 


aggggaagcc 


5220 


cgccgcgtt g 


tttcatcaag 


ccttacggtc 


5280 


ggcttcaggc 


cgccatccac 


tgcggagccg 


5340 


agatggcgct 


cgatgacgcc 


aactacctct 


5400 


gcttccctca 


tgatgtttaa 


ctttgtttta 


5460 
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gggcgactgc cctgctgcgt aacatcgttg ctgctccata acatcaaaca tcga'cccacgj 5520 

i 

gcgtaacgcg cttgctgctt ggatgcccga ggcatagact gtaccccaaa aaaacagtca 5580 

taacaagcca tgaaaaccgc cactgcgccg ttaccaccgc tgcgttcggt caaggttctg 5640 

t 

gaccagttgc gtgagcgcat acgctacttg , cattacagct tacgaaccga acaggcttat 5700 

gtccactggg ttcgtgcctt catccgtttc cacggtgtgc gtcacccg'gc aaccttgggc 5760 

. * < 

agcagcgaag tcgaggcatt tctgtcctgg ctggcgaacg agcgcaaggt ttcggtctcc 5820 
acgcatcgtc aggcattggc ggccttgctg ttcttctacg gcaaggtgct gtgcacggat 5880 

i • " 1 " • 

ctgccctggc ttcaggagat cggaagacct cggccgtcgc ggcgcttgcc ggtggtgctg 5940 
accccggatg aagtggttcg catcctcggt tttctggaag gcgagcatcg tttgttcgcc 6000 

i 

cagcttctgt atggaacggg catgcggatc agtgagggtt tgcaactgcg ggtcaaggat 6060 
ctggatttcg atcacggcac gatcatcgtg cgggagggca agggctccaa ggatcgggcc 6120 

I ! 
| 

i 

ttgatgttac ccgagagctt ggcacccagc ctgcgcgagc aggggaattg atccggtgga 6180 

i 

tgaccttttg aatgaccttt aatagattat attactaatt aattggggac cctagaggtc 6240 
ccctttttta ttttaaaaat tttttcacaa aacggtttac aagcataacg ggttttgctg 6300 
cccqcaaacg ggctgttctg gtgttgctag tttgttatca gaatcgcaga tccggcttca 6360 
ggtttgccgg ctgaaagcgc tatttcttcc agaattgcca tgattttttc cccacgggag/ 6420 
gcgtcactgg ctcccgtgtt gtcggcagct ttgattcgat aagcagcatc gcctgtttca.- 6480 
ggctgtctat gtgtgactgt tgagctgtaa caagttgtct caggtgttca atttcatgtt 6540 
ctagttgctt tgttttactg gtttcacctg ttctattagg tgttacatgc tgttcatctg 6600 
ttacattgtc gatctgttca tggtgaacag ctttaaatgc accaaaaact cgtaaaagct 6660 
ctgatgtatc tatctttttt acaccgtttt catctgtgca tatggacagt tttccctttg 6720 
atatctaacg gtgaacagtt gttctacttt tgtttgttag tcttgatgct tcactgatag 6780 
atacaagagc cataagaacc tcagatcctt ccgtatttag ccagtatgtt ctctagtgtg 6840 
gttcgttgtt tttgcgtgag ccatgagaac gaaccattga gatcatgctt actttgcatg 6900 
tcactcaaaa attttgcctc aaaactggtg agctgaattt ttgcagttaa agcatcgtgt 6960 
agtgtttttc ttagtccgtt acgtaggtag gaatctgatg taatggttgt tggtattttg 7020 
tcaccattca tttttatctg gttgttctca agttcggtta cgagatccat ttgtctatct 7080 
agttcaactt ggaaaatcaa cgtatcagtc gggcggcctc gcttatcaac caccaatttc 7140 
atattgctgt aagtgtttaa atctttactt attggtttca aaacccattg gttaagcctt 7200 
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ttaaactcat ggtagttatt ttcaagcatt aacatgaact taaattpat'c aaggctaatc 7260 

1 i 

< • i 

tctatatttg ccttgtgagt tttcttttgt gttagttctt ttaataacca ctcataaatc 7320 
ctcatagagt atttgttttc aaaagactta acatgttcca gattatattt tatgaatttt 7380 
tttaactgga aaagataagg caatatctct ■ tcactaaaaa ctaattctaa tttttcgctt 7440 
gagaacttgg catagtttgt ccactggaaa atctcaaagc ctttaacc'aa aggattcctg 7500 

atttccacag ttctcgtcat cagctctctg gttgctttag ctaatacacc ataagcattt 7560 

i '" 

tccctactga tgttcatcat ctgagcgtat' tggttataag tgaacgatac cgtccgttct 7620 

i 

ttccttgtag ggtt'ttcaat cgtggggttg agtagtgcca cacagcataa aattagcttg 7680 

I 

I 

gtttcatgct ccgttaagtc atagcgacta atcgctagtt catttgcttt gaaaacaact 11 AO 
aattcagaca tacatctcaa ttggtctagg tgattttaat cactatacca attgagatgg 7800 
gctagtca'at gataattact agtccttttc ctttgagttg tgggtatctg taaattctgc 7860 

r 

tagacctttg ctggaaaact tgtaaattct gctagaccct ctgtaaattc cgctagacct 7920 

ttgtgtgttt tttttgttta tattcaagtg gttataattt atagaataaa gaaagaataa .7980 

aaaaagataa aaagaataga tcccagccct gtgtataact cactacttta gtcaqttccg 8040 

cagtattaca aaaggatgtc gcaaacgctg t-ttgctcctc tacaaaacag accttaaaac • 8100 

cctaaaggct taagtagcac cctcgcaagc tcgggcaaat cgctgaatat tcctt ttgtc* 3160 

tccgaccatc aggcacctga gtcgotgtct ttttcgtgac attcagttcg ctgcgctcao 8220 

ggctctggca gtgaatgggg gtaaatggca ctacaggcgc cttttatgga ttcatgcaag 8280 

gaaactaccc ataatacaag aaaagcccgt cacgggcttc tcagggcgtt ttatggcggg 8340 

tctgctatgt ggtgctatct gactttttgc tgttcagcag ttcctgccct ctgattttcc 8400 

agtctgacca cttcggatta tcccgtgaca ggtcattcag actggctaat gcacccagta 8460 

aggcagcggt atcatcaaca ggcttacccg tcttactgtc aac 8503 

<210> 94 
<211> 7381 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : recombinant 
pAN006 plasmid 

<400> 94 

ttgcggccgc ttcgaaagct gtaatataaa aaccttcttc aactaacggg gcaggttagt 60 
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gacattagaa aaccgactgt aaaaagtaca gtcggcatta tctcatatta taaaagccag 120 
tcattaggcc tatctgacaa ttcctgaata gagttcataa acaatcctgc atgataacca 180 
tcacaaacag aatgatgtac ctgtaaagat agcggtaaat , atattgaatt acctttatta 240 
atgaatttt,c ctgctgtaat aatgggtaga aggtaattac tattattatt gatatttaag 300 

ttaaacccag taaatgaagt ccatggaata atagaaagag. aaaaagcatt ttcaggtata 360 

' 1 ' 1 t 

ggtgttttgg gaaacaattt ccccgaacca ttatatttct ctacatcaga aaggtataaa 420 

i . ■ • 

tcataaaact ctttgaagtc attct'ttaca ggagtccaaa taccagagaa tgttttagat 480 
acaccatcaa aaattgtata aagtggctct aacttatccc aataacctaa ctct<bcgtcg 540 

] 1 

ctattgtaac cagttctaaa agctgtattt gagtttatca cccttgtcac taagaaaata 600 
aatgcagggt aaaatttata tccttcttgt tttatgtttc ggtataaaac actaatatca 660 
atttctgtgg ttatactaaa agtcgtttgt tggttcaaat aatgattaaa tatctctttt 720 

ctcttccaat tgtctaaatc aattttatta aagttcattt gatatgcctc ctaaat tttt 780 

i i 

atctaaagtg aatttaggag gcttact.tgt ctgctttctt cattagaatc aatccttttt 840 

taaaagtcaa tattactgta acataaatat atattttaaa aatatcccac tttatccaat 900 

tttcgtttgt tgaactaatg ggtgctttag ttgaagaata aagaccacat taaaaaatgt 960 
ggtcttttgt gtttttttaa aggatttgag cgtagcgaaa aatccttttc tttcttatct - * 1020 

tgataataag ggtaactatt gaattcggta ccaagagttt gtagaaacgc aaaaaggcca 1080 

tccgtcagga tggccttctg cttaatttga tgcctggcag tttatggcgg gcgtcctgcc 1140 

cgccaccctc cgggccgttg cttcgcaacg ttcaaatccg ctcccggcgg atttgtccta 1200 

ctcaggagag cgttcaccga caaacaacag ataaaacgaa aggcccagtc tttcgactga 1260 

gcctttcgtt ttatttgatg cctggcagtt ccctactctc gcatggggag accccacact 1320 

accatcggcg ctacggcgtt tcacttctga gttcggcatg gggtcaggtg ggaccaccgc 1380 

gctactgccg ccaggcaaat tctgttttat cagaccgctt ctgcgtitctg atttaatctg 1440 

tatcaggctg aaaatcttct ctcatccgcc aaaacaggat cctacggaaa tggagcggca 1500 

aaaccgtttt actctcaaaa tcttaaaaga aaacccccga taaagggggc ttttcttcta 1560 

caaaattgta cgggctggtt cgttccccag catttgttca attttgtttt gatcattcag 1620 

aacagccact ttcggctcat ggcttgccgc ttcttgatca gacatcattt tgtaggaaat 1680 

aataatgacc ttatctcctt cctgcacaag gcgtgcggct gcaccgttta agcatatgac 1740 

gccgcttccc cgtttaccag gaataatata cgtttcaaga cgtgctccat tattattatt 1800 
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cacaatt tqt 


actttttcat 


taggaagcat 


tgtaatgctt 

* 


cccacatagt 


1 

tcagqtttgc 


gctcatcatt 


gttcgataca 


1 

tattatattc 


t atctattaa 


acqegctttt 


gaaaaaqcaa 


t tt cat t cac 


aqqct cqaot 


i 

teeggatagg 


t agt cgtttc 


aatgatatct 


tt tacaqctt 


cttaaacaaa 


ttccgcactt 


I 

Qtttaaaaaa 

^3 ^ *j 


cctcaoctat 


taaotataca 


1 1 acaaaaac 


tatcaacaaa 

*™* *■» V»*J V* V»* 


aaccaat tea 


atatccatoa 


ctacct~acta 


cacatctttt 


aaacraaaat 


oct teat caa 


t accia t coca 

\^ W 1^ 


acccca t caa 


catctataca 


tctttctaca 


! . 

toaatcotaa 

\ 


cat ftaocat 


aaasaaaata 


tcaactccao 


t" a^cacacaa 

^ V* w Vn- \A >■» 


atataettca 


aaatcttcat 




Q d U u V-* \^ 


ttttcttotc 


rataraoaaa 


Vp* Vm» V-,r w \/ ^ \— 


ooaacaaate 

u ci Gk a ok w 


ttataacttc 


tt t caa c tat 


aaaatatcaa 


a aaccat caa 


acact at eta 


at teat" 1 1 a a 


tgt cttacat 


cct aaacat a 


t cegctgat t 


t attgettta 


caaat 1 1 agg 


t gtt ct ctca 


agaacttgtc 


cgtccgcttt 


cacaccagcc 


t eggcaattt 


tggctgtgag 


t tctgccgqc 


gcttcttcgc 


attttatact 


gtcttctatt 


tgtactttat 


agccgcccag 


tacgccgact 


actggaatgc 


ctccaagcgt 


caatgegega 


agcttcagtg 


cgtcagctcc 


gctttcctga 


tccttagaca 


ggtgat aaga 


cataaaegge 


ccccttttaa 


eggcttttgt 


atgatggatc 


teaaggcega 


ggacgaccat 


tccaagtgaa 
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tgttcagcaa gtttagctgc cggataatca taagcggtca gcatgacaat cggttcttca 3600 

gactccttca tttttagaaa atccagttt't" gttttcatgt tttctcctcc tctagagcgt 3660 

cctgctgttg ttaagattat tataccacac cttgtagata aagtcaacaa ctttttgcaa 3720 

aatttttcag gaattttagc agaggttgtt ctggatgtag aacaaaacat ctttccgctc 3780 

ttgtgctgtt aggatatctt tcttggaagc taggtaggcc tcgagttatg gcagttggtt 3840 

i 

aaaaggaaac aaaaagaccg ttttcacac& aaacggtctt tttcgatttc tttttacagt 3900 

cacagccact tttgcaaaaa ccggacagct tcatgcctta taactgctgt ttcggtcgac 3960 

i 

ctgcaggcat gcaagcttcg cgaagcggcc gccgacgcga ggctggatgg ccttccccat 4020 

tatgattctc ctcgcttccg gcggcatcgg gatgcccgcg ttgcaggcca tgctgtccag 4080 

gcaggtagat gacgaccatc agggacagct tcaaggatcg ctcgcggctc ttaccagcct 4140 

aacttcgatc actggaccgc tgatcgtcac ggcgatttat gccgcctcgg cgagcacatg ^200 

gaacgggttg gcatggattg taggcgccgc cctatacctt gtctgcctcc ccgcgttgcg 4260 

tcgcggtgca tggagccggg ccacctcgac ctgaatggaa gccggcggca cctcgctaac 4320 

ggattcacca ctccaagaat tggagccaat caattcttgc ggagaactgt gaatgcgcaa 4380 

accaaccctt ggcagaacat atccatcgcg tccgccatct ccagcagccg cacgcggcgc 4 440 

atctcgggca gcgttgggtc ctggccacgg gtgcgcatga tcgtgctcct gtcgttgagg*. 4500 

acccggctag gctggcgggg ttgccttact ggttagcaga atgaatcacc gatacgcgag 4 560 

cgaacgtgaa gcgactgctg ctgcaaaacg tctgcgacct gagcaacaac atgaatggtc . 4 620 

ttcggtttcc gtgtttcgta aagtctggaa acgcggaagt cagcgccctg caccattatg 4680 

ttccggatct gcatcgcagg atgctgctgg ctaccctgtg gaacacctac atctgtatta 4740 

acgaagcgct ggcattgacc ctgagtgatt tttctctggt cccgccgcat ccataccgcc 4800 

agttgtttac cctcacaacg ttccagtaac cgggcatgtt catcatcagt aacccgtatc 4860 

gtgagcatcc tctctcgttt catcggtatc attaccccca tgaacagaaa ttccccctta 4920 

cacggaggca tcaagtgacc aaacaggaaa aaaccgccct taacatggcc cgctttatca 4 980 

gaagccagac attaacgctt ctggagaaac tcaacgagct ggacgcggat gaacaggcag 5040 

acatctgtga atcgcttcac gaccacgctg atgagcttta ccgcagctgc ctcgcgcgtt 5100 

tcggtgatga cggtgaaaac ctctgacaca tgcagctccc ggagacggtc acagcttgtc 5160 

tgtaagcgga tgccgggagc agacaagccc gtcagggcgc gtcagcgggt gttggcgggt 5220 

gtcggggcgc agccatgacc cagtcacgta gcgatagcgg agtgtatact ggcttaacta 5280 
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tgcggcatca gagcagattg tactgagagt gcaccatatg t cggtgtgaaa t.accgcacag 5340 

i 1 i • ■ | ' 

atgcgtaagg agaaaatacc gcatcaggcg ctcttccgct tcctcgctca ctgactcgct 54 00 

gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 54 60 

atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc 5520 

caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga 5580 

gpatcacaaa aatcgacgct caagtcaga^ gtggcgaaac ccgacaggac tataaagata 5640 

ccaggcgttt ccc'cctggaa gctccctcgt gcgctct'cct: gttccgaccc tgccgcttac 5700 

* .i ' ■ * 

cggatacctg tccgcctttc tcccttcggg aagc'gtggcg ctttctcata gctcacgctg 5760 

taggtatctc agttcggtgt aggtcgttq'g ctccaagctg ggctgtgtgc acgaaccccc 5820 

cgttcag^cc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag 5880 

acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt 5940 

aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacac.ta gaaggacagt 6000 

atttqgtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg 1 6060 

atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac 6120 

gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca 6180 

gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac 6240* 

ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac 6300 

ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt 6360 

tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt 6420 

accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt 6480 

atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc 6540 

cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa 6600 

tagtttgcgc aacgttgttg ccattgctgc aggcatcgtg gtgtcacgct cgtcgtttgg 6660 

tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt 6720 

gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc 6780 

agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt 6840 

aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg 6900 

gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac 6960 

tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc 7020 
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gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt 7080, 

. . ■ 1 , ■ i ' 

, • i 

tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg 7140 

> 'iii 

aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag 7200 

i 

i 

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa 7260 

acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct aagaaaccat 7320 

1 i 

tattatcatg acattaacct ataaapatag gcgtatcacg aggccctttd gtcttcaaga 7380 

7381 
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3. 1 1 Claims Nos.: ' J 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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1 * i 

This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows; 

1. Claims: 1-6, 12, 13, 24, 26-28, 33-35, 48, 51, 54-64, 71;76, 78-86, ' 

92,93,95,97-100,102,103 (all partially) 

i 

t 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising culturing a , 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses a 
ketopantoate hydroxymethyl transferase-encoding gene, e.g. 
the panB gene, e.g., from Bacillus, under conditions such 
that said panto- compound is produced, and possibly further 
recovering the compound:. A recombinant mi croorgani sm (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g!* 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtil is) ketopantoate hydroxymethyl transferase-encoding 
gene. A recombinant vector encoding a Bacillus (subtilis) 
ketopantoate hydroxymethyl transferase-encoding gene operably 
linked to regulatory sequences, e.g., comprising a nucleic 
acid sequence according to SEQ ID N0:23 or part of SEQ ID 
N0:59. An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) ketopantoate hydroxymethyl transferase,- and said 
isolated ketopantoate hydroxymethyl transferase polypeptide. 



2. Claims: 1-6,12,13,24,26-28,33-35,48,51,54-64,71,76,78-86, 

92,93,95,97-100,102,103 (all partially) 



A method of producing panthotenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., in a manner 
independent of precursor feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses a 
pantothenate synthetase-encoding gene, e.g. the panC gene, 
e.g., from Bacillus, under conditions such that said 
panto-compound is produced, and possibly further recovering 
the compound. A recombinant microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
-Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtilis) pantothenate synthetase-encoding gene. A 
recombinant vector encoding a Bacillus (subtilis) 
pantothenate synthetase-encoding gene operably linked to 
regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID N0:25 or part of SEQ ID N0:59. 
An isolated nucle lecule encoding a Bacillus 

(subtilis) pantothenate synthetase, and said isolated 
pantothenate synthetase polypeptide. 
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3. Claims: 1-6,12-14,24,26-28,33-35,48,49,51,54-64,66,71,76, 

78-86,92,93,95,97-100,102,103 (all partially); 15, 
17,19,23,32,106,107 (both completely) 

i 

i 

1 * I 

A method of producing pantothenate (e.g., 2 g/L up.to 4G g/L 
at least), pantoate, or| ketopantoate, e.g., independent of 
aspartate or beta-alanine feed, comprising culturing a i . ■ . 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobaci 1 1 us,. Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, e.g., from 
Baci llus, e.g. , the ^spartate-alpha-decarboxylase-encoding 
panD gene from Bacillus "subtil is, under conditions such that 
said panto-compound is produced, and possibly further 
recovering the compound; A recombinant microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., ' 
Bacillus subtilis), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) which overexpresses a Bacillus 
(subtilis) aspartate-alpha-decarboxylase-encoding gene. A 
recombinant vector encoding a Bacillus (subtilis) 
aspartate-alpha-decarboxylase-encoding gene operably linked 
to regulatory sequences, e.g., comprising a nucleic acid 
sequence according to SEQ ID N0:27 or part of SEQ ID N0;59. 
An isolated nucleic acid molecule encoding a Bacillus 
(subtilis) aspartate-alpha-decarboxylase, and said isolated 
aspartate-alpha-decarboxylase polypeptide. 



4. Claims: 1-6,24,26-28,33-35,48,49,51,54-64,71,76,78-87,92, 

93,95,97-100,102,103 (all partially); 7-11,65,101, 
104,105 (all completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), pantoate, or ketopantoate, e.g., independent of 
precursor feed, comprising culturing a microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus, 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces) 
or Gram-negative) which overexpresses a ketopantoate 
reductase-encoding gene, e.g., from Bacillus, e.g., the 
ketopantoate reductase-encoding panEl gene from Bacillus 
subtilis, under conditions such that said panto-compound is 
produced, and possibly further recovering the compound. A 
recombinant microorganism (e.g., Gram-positive (e-9-» 
belonging to the genus Bacillus (e.g., Bacillus subtilis), 
Corynebacterium, Lactobacillus, Lactococci or Streptomyces)) 
which overexpresses a Bacillus (subtilis) ketopantoate 
reductase-encoding gene. A recombinant vector encoding a 
Bacillus (subtilis) ketopantoate reductase-encoding gene 
operably linked to regulatory sequences, e.g., comprising a 
nucleic acid sequence according to SEQ ID N0:29. An isolated 
nucleic acid molecule encoding a Bacillus (subtilis) 
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ketopantoate' reductase, and said isolated ketopantoate 
reductase polypeptide. 

5. Claims: 14,16,18,28,48,54-61,66,77-82,' ' ' ' 

97 (all partially); 20 t) 29 (completely) 

A method of producing pantothenate (e.g. 1 , 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or , 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) having a deregulated 
i soleucine-val ine (ilv) pathway, wherein the microorganism 
overexpresses acetohydroxyacid synthase or is transformed 
with a vector comprising an ilvBN nucleic acid sequence or 
an alsS sequence, e.g., from Bacillus, under conditions such 
that pantothenate is produced, and possibly further 
recovering the pantothenate. Said microorganism (e.g., 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)), and said vector. 



6. Claims: 14,16,18,28,48,54-61,66,77-82, 

97 (all partially); 21,30 (completely) 



A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) having a deregulated 
i soleucine-val ine (ilv) pathway, wherein the microorganism 
overexpresses acetohydroxyacid i someroreductase or is 
transformed with a vector comprising an ilvC nucleic acid 
sequence, e.g., from Bacillus, under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. Said microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtilis), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)), and said vector. 



7. Claims: 14,16,18,28,48,54-61,66,77-82,97 (partially); 22, 

31 (completely) 



A method of producing pantothenate (e.g., 2 g/L;up to 40 g/L 
at least), e.g., in a manner independent of valine or 
alpha-ketoisovalerate feed, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
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or Streptomyces) or Gram-negative) having a deregulated 
isoleucine-valine (i 1 v) pathway, wherein the microorganism 
overexpresses dihydroxyacid dehydratase or is transformed 
with a vector comprising an ilvD nucleic acid sequence* 
e.g., from Bacillus, under conditions such that pantothenate 
is produced, and possibjly further recovering the< 
pantothenate. Said microorganism (e.g., Gram-positive (e.g., 
. belonging to the genus Bacillus (e.g., Bacillus subtilis), 
Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces)), and said vector. ^ i i 

i 

8. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism (e.g., Gram-positive 
(e.g., belonging to thegenus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negati vej 
having a mutant avtA gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 



9. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant ilvE gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 

10. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
having a mutant ansB gene under conditions such that 
pantothenate is produced, and possibly further recovering 
the pantothenate. 



11. Claims: 25,28,50,54-61 (all partially) 

A method of producing pantothenate (e.g., 2 g/L up to 40 g/L 
at least), e.g., in a manner independent of precursor feed, 
comprising culturing a microorganism (e.g., Gram-positive 
(e.g., belonging to the genus Bacillus, Corynebacterium, 
Lactobacillus, Lactococci or Streptomyces) or Gram-negative) 
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having a mutant alsD gene under conditions such that 
pantothenate is produced, and possibly further recovering 

the pantothenate. i » 

it i ■ 1 

i 1 . . . * i 

■ , ( ■ ■ , . i 

* ♦ ■ 

12. Claims: 36,37,54-59,61 (all partially) 

i i i , 1 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g. 1 , bielonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
hydroxymethyl transferase (panB) , and possibly further 
recovering the compound. 

13. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressing microorganism has 
a mutation in a nucleic acid sequence encoding ketopantoate 
reductase (panE), and possibly further recovering the 
compound. 



14. Claims: 36,37,54-59,61 (all partially) 

A method of producing beta-alanine, comprising culturing a 
microorganism (e.g., Gram-positive (e.g., belonging to the 
genus Bacillus, Corynebacterium, Lactobacillus, Lactococci 
or Streptomyces) or Gram-negative) which overexpresses an 
aspartate-alpha-decarboxylase-encoding gene, under 
conditions such that beta-alanine is produced, wherein the 
asparte-alpha-decarboxylase-overexpressi ng microorganism has 
a mutation in a nucleic acid sequence encoding pantothenate 
synthetase (panC), and possibly further recovering the 
compound. 



15. Claim : 38 (completely) 

A method of producing beta-alanine comprising contacting a 
composition comprising aspartate with an isolated Bacillus 
aspartate-alpha-decarboxylase enzyme under conditions such 
that beta-alanine is produced. 
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16. Claims: 41,44-47,51,53,54-61,69,71,72,75,78-81, 

97 (all partially); 39,43,52,67,70,74,88-91, 
108-110 (a,ll completely) 



A method for producing or for enhancing production of 
ketopantoate, pantoate, or pantothenate (e.g., 10, 20 or 40 
g/L at least), comprising culturing a mutant microorganism 
(e.g., Gram-positive (e.g., belonging tothe.genus Bacillus, 
Corynebacterium, Lactobacillus, Lactbcocci or Streptomyces) 
or Gram-negative) having a mutant pantothenate 
kinase-encoding coaX gene, . under conditions such that said 
panto-compound is produced' or that production is enhanced, 
and possibly further recovering the compound. A method fpr 
identifying compounds which modulate pantothenate kinase 
activity comprising contacting a recombinant cell expressing 
the coaX gene, possibly further comprising a mutant, coaA 
gene encoding a pantothenate kinase with reduced activity, 
with a test compound and determining, the ability of the test 
compound to modulate pantothenate kinase activity in said 
cell. A recombinant microorganism having a mutant coaX gene 
encoding a pantothenate kinase with reduced activity. A 
vector comprising a mutant coaX gene encoding a pantothenate 
kinase with reduced activity, possibly further comprising 
regulatory sequences. A recombinant microorganism comprising 
a vector comprising an isolated coaX gene (e.g., from 
Bacillus (subtil is)), and said vector, possibly further 
comprising regulatory sequences, e.g., a consti tuti vely 
active promoter. A recombinant microorganism (e.g.» 
Gram-positive (e.g., belonging to the genus Bacillus (e.g., 
Bacillus subtil is), Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces)) that overproduces a 
panto-compound having a mutation in a coaX gene that results 
in a reduced level of pantothenate kinase activity, 
resulting in a decrease in the capacity of the microorganism 
to synthesize coenzyme A. An isolated nucleic acid molecule 
comprising a (mutant) coaX gene, and an isolated 
pantothenate kinase protein encoded by a coaX gene. 



17. Claims: 41,44-47,51,53-61,69,71,72,75,78-81, 

97 (all partially); 40,42,68,73 (all completely) 



A method for producing or for enhancing production of a 
panto- compound, e.g., ketopantoate, pantoate or pantothenate 
(e.g., 10, 20 or 40 g/L at least), comprising culturing a 
mutant microorganism (e.g., Gram-positive (e.g., belonging 
to the genus Bacillus, Corynebacterium, Lactobacillus, 
Lactococci or Streptomyces) or Gram-negative) having a 
mutant pantothenate kinase-encoding coaA gene, under 
conditions such that the panto-compound is produced or that 
production is enhanced, and possibly further recovering the 
panto- compound. A recombinant microorganism having a mutant 
coaA gene encoding a pantothenate kinase with reduced 
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activity. A recombinant microorganism (e^g., Gram-positive 
(e.g., belonging to the genus Bacillus (e.g., Bacillus 
subtil is), Corynebacterium, Lactobacillus, Lactococci or 
Streptomyces) ) that overproduces a panto-compound having a 
mutation in a coaA gene that results in a reduced level of 
pantothenate kinase activity, resulting in a decrease in the, 
capacity of the microorganism to synthesize coenzyme A. A 
vector containing a (mutated) coaA gene. 



18. Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
consti tuti vely active promoter Pveg (SEQ ID N0:41). 

19. Claim : 94 (partially) j 

i ' 

A vector containing regulatory sequences comprising the 
constituti vely active promoter P15 (SEQ ID N0:39). 

20. Claim : 94 (partially) 

A vector containing regulatory sequences comprising the 
constitutively active promoter P26 (SEQ ID N0:40). 

21. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:49. 



22. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:50. 



23. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:51. 



24. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:52. 



25. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
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artificial RBS according to SEQ ID N0:53. 



26. Claim : 96 (partially) 

1 A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:54. 



27. Claim : 96 (partially) j 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:55. 



28. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:56. 



29. Claim : 96 (partially) 

A vector containing regulatory sequences comprising an 
artificial RBS according to SEQ ID N0:57. 



I.' i 
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